High level in vivo biosynthesis and isolation of water-soluble cannabinoids in stably transformed plant systems

ABSTRACT

The inventive technology relates to systems and methods for enhanced in vivo production, accumulation and modification of cannabinoids. In one embodiment, the invention may include systems and methods for enhanced in vivo biosynthesis of chemically-modified water-soluble cannabinoids in stably transformed whole plant, or a cell suspension culture system.

This application is a Continuation Application of U.S. application Ser. No. 16/110,728, filed Aug. 23, 2018; which claims the benefit of and priority to U.S. Provisional Application Nos. 62/476,080, filed Mar. 24, 2017, and 62/588,662, filed Nov. 20, 2017, and 62/621,166, filed Jan. 21, 2018. The entire specifications and figures of the above-referenced applications are hereby incorporated, in their entirety by reference.

SEQUENCE LISTING

The instant application contains a Sequence Listing which has been submitted electronically in ASCII format and is hereby incorporated by reference in its entirety.

TECHNICAL FIELD

The field of the present invention relates generally to plant molecular biology and plant biotechnology. More specifically, it relates to novel systems, methods and compositions for the in vivo production, modification and isolation of cannabinoid compounds from plant systems, including whole plants and/or plant cell cultures systems. In certain preferred embodiments, the inventive technology includes a novel system of genetically modifying a plant or plant cell suspension culture to produce, modify and/or accumulate one or more target cannabinoids in Cannabis and/or Nicotiana benthamiana and/or Nicotiana tabacum

BACKGROUND

Cannabinoids are a class of specialized compounds synthesized by Cannabis. They are formed by condensation of terpene and phenol precursors. They include these more abundant forms: Delta-9-tetrahydrocannabinol (THC), cannabidiol (CBD), cannabichromene (CBC), and cannabigerol (CBG). Another cannabinoid, cannabinol (CBN), is formed from THC as a degradation product and can be detected in some plant strains. Typically, THC, CBD, CBC, and CBG occur together in different ratios in the various plant strains.

Cannabinoids are generally classified into two types, neutral cannabinoids and cannabinoid acids, based on whether they contain a carboxyl group or not. It is known that, in fresh plants, the concentrations of neutral cannabinoids are much lower than those of cannabinoid acids. One strain Cannabis sativa contains approximately 61 compounds belonging to the general class of cannabinoids. These cannabinoids are generally lipophilic, nitrogen-free, mostly phenolic compounds, and are derived biogenetically from a monoterpene and phenol, the acid cannabinoids from a monoterpene and phenol carboxylic acid, and have a C21 to base material.

Cannabinoids also find their corresponding carboxylic acids in plant products. In general, the carboxylic acids have the function of a biosynthetic precursor. For example, these compounds arise in vivo from the THC carboxylic acids by decarboxylation the tetrahydrocannabinols Δ9- and Δ8-THC and CBD from the associated cannabidiol. As generally shown in FIG. 28 , THC and CBD may be derived artificially from their acidic precursor's tetrahydrocannabinolic acid (THCA) and cannabidiolic acid (CBDA) by non-enzymatic decarboxylation.

Cannabinoids are widely consumed, in a variety of forms around the world. Cannabinoid-rich preparations of Cannabis, either in herb (i.e. marijuana) or resin form (i.e., hash oil), are used by an estimated 2.6-5.0% of the world population (UNODC, 2012). Cannabinoid containing pharmaceutical products, either containing natural cannabis extracts (Sativex®) or the synthetic cannabinoids dronabinol or nabilone, are available for medical use in several countries

As noted above, Δ-9-tetrahydrocannabinol (also known as THC) is one of the main biologically active components in the Cannabis plant which has been approved by the Food and Drug Administration (FDA) for the control of nausea and vomiting associated with chemotherapy and, more recently, for appetite stimulation of AIDS patients suffering from wasting syndrome. The drug, however, shows other biological activities which lend themselves to possible therapeutic applications, such as in the treatment of glaucoma, migraine headaches, spasticity, anxiety, and as an analgesic.

Indeed, it is well documented that agents, such as cannabinoids and endocannabinoids that activate cannabinoid receptors in the body modulate appetite, and alleviate nausea, vomiting, and pain (Martin B. R. and Wiley, J. L, Mechanism of action of cannabinoids: how it may lead to treatment of cachexia, emesis and pain, Journal of Supportive Oncology 2: 1-10, 2004), multiple sclerosis (Pertwee, R. G., Cannabinoids and multiple sclerosis, Pharmacol. Ther. 95, 165-174, 2002), and epilepsy (Wallace, M. J., Blair, R. E., Falenski, K. W W., Martin, B. R., and DeLorenzo, R. J. Journal Pharmacology and Experimental Therapeutics, 307: 129-137, 2003). In addition, CB2 receptor agonists have been shown to be effective in treating pain (Clayton N., Marshall F. H., Bountra C., O'Shaughnessy C. T., 2002. CB1 and CB2 cannabinoid receptors are implicated in inflammatory pain. 96, 253-260; Malan T. P., Ibrahim M. M., Vanderah T. W., Makriyannis A., Porreca F., 2002. Inhibition of pain responses by activation of CB(2) cannabinoid receptors. Chemistry and Physics of Lipids 121, 191-200; Malan T. P., Jr., Ibrahim M. M., Deng H., Liu Q., Mata H. P., Vanderah T., Porreca F., Makriyannis A., 2001. CB2 cannabinoid receptor-mediated peripheral antinociception. 93, 239-245.; Quartilho A., Mata H. P., Ibrahim M. M., Vanderah T. W., Porreca F., Makriyannis A., Malan T. P., Jr., 2003. Inhibition of inflammatory hyperalgesia by activation of peripheral CB2 cannabinoid receptors. Anesthesiology 99, 955-960) and multiple sclerosis (Pertwee, R. G., Cannabinoids and multiple sclerosis, Pharmacol. Ther. 95, 165-174, 2002) in animal models.

More recently, several states have approved use of Cannabis and cannabinoid infused products for both recreational and medical uses. As these new medical and commercial markets have developed, there has grown a need to develop more efficient production and isolation of cannabinoid compounds. Traditional methods of cannabinoid production typically focus on extraction and purification of cannabinoids from raw harvested Cannabis. However, traditional cannabinoid extraction and purification methods have a number of technical and practical problems that limits its usefulness.

Limitations of Traditional Cannabinoid Production and Extraction Methods

For example, in U.S. Pat. No. 6,403,126 (Webster et al.), cannabinoids, and other related compounds are isolated from raw harvested Cannabis and treated with an organic solvent, typically a petroleum derived hydrocarbon, or a low molecular-weight alcohol to solubilize the cannabinoids for later isolation. This traditional method is limited in that it relies on naturally grown plant matter that may have been exposed to various toxic pesticides, herbicides and the like. In addition, such traditional extraction methods are imprecise resulting in unreliable and varied concentrations of extracted THC. In addition, many Cannabis strains are grown in hydroponic environments which are also not regulated and can results in the widespread contamination of such strains with chemical and other undesired compounds.

In another example, US Pat. App. No. 20160326130 (Lekhram et al.), cannabinoids, and other related compounds are isolated from raw harvested Cannabis using, again, a series of organic solvents to convert the cannabanoids into a salt, and then back to its original carboxylic acid form. Similar to Webster, this traditional method is limited in that is relies on naturally grown plant matter that may have been exposed to various toxic pesticides, herbicides and the like. In addition, the multiple organic solvents used in this traditional process must be recovered and either recycled and/or properly disposed of Another traditional method of cannabinoid extraction involves the generation of hash oils utilizing supercritical carbon-dioxide (sCO₂). Under this traditional method, again the dried plant matter is ground and subjected to a sCO₂ extraction environment. The primary extract being initially obtained and further separated. For example, as generally described by CA2424356 (Muller et al.) cannabinoids are extracted with the aid of sCO₂ under supercritical pressure and temperature conditions and by the addition of accessory solvents (modifiers) such as alcohols. Under this process, this supercritical CO₂ evaporates and dissolves into the cannabinoids. However, this traditional process also has certain limiting disadvantages. For example, due to the low solubility in supercritical sCO₂, recovery of the cannabinoids of interest is inconsistent. Additionally, any solvents used must be recycled and pumped back to the extractor, in order to minimize operating costs.

Another method utilizes butane to extract cannabinoids, in particular high concentrations of THC, from raw harvested Cannabis. Because butane is non-polar, this process does not extract water soluble by-products such as chlorophyll and plant alkaloids. That said, this process may take up to 48 hours and as such is limited in its ability to scale-up for maximum commercial viability. The other major drawback of traditional butane-based extraction processes is the potential dangers of using flammable solvents, as well as the need to ensure all of the butane is fully removed from the extracted cannabinoids.

Another limiting factor in the viability of these traditional methods of cannabinoid extraction methods is the inability to maintain Cannabis strain integrity. For example, cannabanoids used in medical and research applications, or that are subject to controlled clinical trials, are tightly regulated by various government agencies in the United States and elsewhere. These regulatory agencies require that the Cannabis strains remain chemically consistent over time. Unfortunately, the genetic/chemical compositions of the Cannabis strains change over generations such that they cannot satisfy regulatory mandates present in most clinical trials or certified for use in other pharmaceutical applications.

Several attempts have been made to address these concerns. For example, efforts have been made to produce cannabinoids in genetically engineered organisms. For example, in U.S. patent application Ser. No. 14/795,816 (Poulos, et al.) Here, the applicant claims to have generated a genetically modified strain of yeast capable of producing a cannabinoid by inserting genes that produce the appropriate enzymes for its metabolic production. However, such application is limited in its ability to produce only a single or very limited number of cannabinoid compounds. This limitation is clinically significant. Recent clinical studies have found that the use of a single isolated cannabinoid as a therapeutic agent is not as effective as treatment with the naturally-occurring “entourage” of primary and secondary cannabinoids associated with various select strains.

Additional attempts have been made to chemically synthesize cannabinoids, such as THC. However, the chemical synthesis of various cannabinoids is a costly process when compared to the extraction of cannabinoids from naturally occurring plants. The chemical synthesis of cannabinoids also involves the use of chemicals that are not environmentally friendly, which can be considered as an additional cost to their production. Furthermore, the synthetic chemical production of various cannabinoids has been classified as less pharmacologically active as those extracted from plants such as Cannabis sativa.

Efforts to generate large-scale Cannabis cell cultures have also raised a number of technical problems. Chief among them is the fact that cannabinoids are cytotoxic. Under natural conditions cannabinoids are generated and then stored extracellularly in small glandular structures called trichomes. Trichomes can be visualized as small hairs or other outgrowths from the epidermis of a Cannabis plant. As a result, in Cannabis cell cultures, the inability to store cannabanoids extracellularly means any accumulation of cannabinoids would be toxic to the cultured cells. Such limitations impair the ability of Cannabis cell cultures to be scaled-up for industrial levels of production.

Cannabinoid Biosynthesis Toxicity Limits In Vivo Production Systems

Efforts to generate Cannabis strains/cell cultures that produce or accumulate high-levels of cannabinoids have raised a number of technical problems. Chief among them is the fact that cannabinoid synthesis produces toxic by-products. Notably, both CBDA and THCA synthases require molecular oxygen, in conjunction with a molecule of FAD, to oxidize Cannabigerolic acid (CBGA). Specifically, as shown in FIG. 29 , two electrons from the substrate are accepted by an enzyme-bound FAD, and then transferred to molecular oxygen to re-oxidize FAD. CBDA and THCA are synthesized from the ionic intermediates via stereoselective cyclization by the enzymes. The hydride ion is transferred from the reduced flavin to molecular oxygen, resulting in the formation of hydrogen peroxide and re-activation of the flavin for the next cycle. As a result, in addition to producing CBDA and THCA respectively, this reaction produces hydrogen peroxide (H₂O₂) which is naturally toxic to the host cell. Due to this production of a toxic hydrogen peroxide byproduct, cannabinoid synthesis generates a self-limiting feed-back loop preventing high-level production and/or accumulation of cannabinoids in in vivo systems. One way that Cannabis plants deal with these cellular cytotoxic effects is through the use of trichomes for Cannabinoid production and accumulations.

Cannabis plants deal with this toxicity by sequestering cannabinoid biosynthesis and storage extracellularly in small glandular structures called trichomes as note above. For example, THCA synthase is a water soluble enzyme that is responsible for the production of THC. For example, THC biosynthesis occurs in glandular trichomes and begins with condensation of geranyl pyrophosphate with olivetolic acid to produce cannabigerolic acid (CBGA); the reaction is catalyzed by an enzyme called geranylpyrophosphate:olivatolate geranyltransferase. CBGA then undergoes oxidative cyclization to generate tetrahydrocannabinolic acid (THCA) in the presence of THCA synthase. THCA is then transformed into THC by non-enzymatic decarboxylation. Sub-cellular localization studies using RT-PCR and enzymatic activity analyses demonstrate that THCA synthase is expressed in the secretory cells of glandular trichomes, and then is translocated into the secretory cavity where the end product THCA accumulates. THCA synthase present in the secretory cavity is functional, indicating that the storage cavity is the site for THCA biosynthesis and storage. In this way, the Cannabis is able to produce cannabinoids extracellularly and thereby avoid the cytotoxic effects of these compounds. However, as a result, the ability to access and chemically alter cannabinoids in vivo is impeded by this cellular compartmentalization.

To address these concerns, some have proposed chemically modifying cannabinoid compounds to reduce their cytotoxic effects. For example, Zipp, et al. have proposed utilizing an in vitro method to produce cannabinoid glycosides. However, this application is limited to in vitro systems only. Specifically, as noted above, cannabinoid synthase enzymes, such as THCA synthase, are water soluble proteins that are exported out of the basal trichome cells into the storage compartment where it is active and catalyzes the synthesis of THCA. Specifically, in order to effectively mediate the cellular export of such cannabinoid synthase, this enzyme contains a 28 amino acid signal peptide that directs its export out of the cell and into the extracellular trichrome where cannabinoid synthesis occurs. As a result of this signal-dependent extracellular compartmentalization of, in this instance, THCA synthase, this means that the THCA is made outside of the cytoplasm and would not be accessible to genetically engineered glycosylation enzymes. As such, simple expression of a UDP glycosyltransferase in plant cells, as vaguely alluded to in Zipp, et al., would not result in effective glycosylation of cannabinoid molecules in the compartmentalized and extracellular trichrome structure where cannabinoid synthesis occurs. Neither can the method of Zipp generate acetylated cannabinoids, as well as 0 acetyl glycoside cannabinoid molecules.

The foregoing problems regarding the production, detoxification and isolation of cannabinoids may represent a long-felt need for an effective—and economical—solution to the same. While implementing elements may have been available, actual attempts to meet this need may have been lacking to some degree. This may have been due to a failure of those having ordinary skill in the art to fully appreciate or understand the nature of the problems and challenges involved. As a result of this lack of understanding, attempts to meet these long-felt needs may have failed to effectively solve one or more of the problems or challenges here identified. These attempts may even have led away from the technical directions taken by the present inventive technology and may even result in the achievements of the present inventive technology being considered to some degree an unexpected result of the approach taken by some in the field.

As will be discussed in more detail below, the current inventive technology overcomes the limitations of traditional cannabinoid production systems while meeting the objectives of a truly effective and scalable cannabinoid production, modification and isolation system.

SUMMARY OF THE INVENTION(S)

The inventive technology may encompass systems, methods and compositions for the in vivo production, modification and isolation of cannabinoid compounds from Cannabis plants. In particular, the invention provides systems and methods for high level in vivo biosynthesis of water-soluble cannabinoids.

The current inventive technology includes systems and methods for enhanced production and/or accumulation of cannabinoids. In one embodiment, the invention may include systems and methods for enhanced production and/or accumulation of cannabinoids in an in vivo system, such as a plant, or plant cell culture.

Another aim of the current invention may include the generation of genetically modified plants overexpressing certain endogenous/exogenous genes that result in the over-production and/or accumulation of cannabinoids above wild-type levels. In one preferred embodiment, such transgenic plants may exhibit enhanced production and localized accumulation of cannabinoid precursor compounds, such as THCA (tetrahydrocannabinolic acid), CBCA (cannabichromenic acid), and CBDA (cannabidiolic acid). Such transgenic plants may additionally exhibit enhanced production and localized accumulation of cannabinoids, such as THCs, CBCs and CBDs. An additional aim of the current invention may include the generation of genetically modified plants expressing certain endogenous/exogenous that result in the enhanced modification of cannabinoids. In one preferred embodiment, such transgenic plants may exhibit enhanced modification of cannabinoids including hydroxylation, and/or acetylation, and/or glycosylation. In additional preferred embodiments, such transgenic plants may exhibit enhanced modification of cannabinoids including acetylation and glycosylation, such as an O acetyl glycoside form. For example, acetylation adds an acetyl group (—CH₃OOH) to a cannabinoid such that the carboxylate group is acidic and charged at neutral pH making it highly water-soluble.

One aim of the current inventive technology may be to generate a genetically modified or transgenic Cannabis plant that overexpresses one or more transcription factors, such as myb, that enhance metabolite flux through the cannabinoid biosynthetic pathway. In one preferred embodiment, these transcription factors may include various analogues. In certain preferred embodiment, one or more of these transgenes may be operably-linked to one or more promoters.

Another aim of the current inventive technology may be to generate a genetically modified or transgenic Cannabis cell culture that overexpresses one or more transcription factors that enhance metabolite flux through the cannabinoid biosynthetic pathway. In one preferred embodiment, these transgenes may be operably linked to one or more promoters.

Another aim of the current inventive technology may be to generate a genetically modified or transgenic Cannabis plant that expresses one or more exogenous/heterologous transcription factors that up-regulated trichome formation to increase cannabinoid accumulation. In certain preferred embodiments, one or more of these exogenous transgenes may be operably linked to one or more promoters.

Yet, another aim of the current inventive technology may be to generate a genetically modified or transgenic Cannabis plant that expresses an enzyme that is configured to be capable of reducing hydrogen peroxide (H₂O₂) levels that may be generated during cannabinoid synthesis. In one preferred embodiment, the current inventive technology may be to generate a genetically modified or transgenic Cannabis plant that expresses a chimeric protein. In this embodiment, this chimera protein may include a first domain that may reduce hydrogen peroxide (H₂O₂) levels that may be generated during cannabinoid synthesis. This chimera/fusion protein may further include a second domain that may comprise a trichome targeting domain that may allow targeted localization of the chimeric protein to locations of active cannabinoid synthesis. In some embodiments, a third domain may include a linker which may further separate the first domain from the second domain, such that said first domain and said second domain can each fold into its appropriate three-dimensional shape and retains its activity and said linker ranges in length.

Another aim of the current inventive technology may include the generation of one or more of the above referenced genetically modified plant or plant cell cultures utilizing Agrobacterium Ti-plasmid mediated transformation.

Another aim of the present inventive technology relates methods and systems for the in vivo cellular localization of cannabinoid biosynthesis and modification. More specifically, the present inventive technology relates methods and systems for the in vivo cellular localization of cannabinoid hydroxylation, acetylation and/or glycosylation. The inventive technology may include systems and methods for high-efficiency localized chemical modification and isolation of cannabinoid compounds from suspension cultures. In this embodiment, various select cannabinoid compounds may be chemically modified into soluble and non-toxic configurations.

Additional embodiments of the inventive technology may include the transient modification of cannabinoid compounds to reduce and/or eliminate their cytotoxicity in plants or plant cell culture systems. In a preferred embodiment, such transiently modified cannabinoids may be allowed to accumulate at levels that would normally have a deleterious effect on the cell. Additional embodiments may include the isolation of these transiently modified cannabinoids followed by enzymatic conversion or reconstitution to their original and/or partially modified structure.

Another aim of the invention may include the generation of a transgenic plant and or plant cell cultures that may express heterologous genes that coupled cannabinoid synthesis and hydroxylation and/or glycosylation in planta. Specifically, one aim of the technology may include using Nicotiana benthamiana to demonstrate the coupling CBDA synthesis and glycosylation in planta. An, additional aim of this embodiment may include additional modifications in the CBDA molecule, such as hydroxylation and acetylation. In yet another aim, this cannabinoid modification may be specifically localized, for example in the cytosol and/or trichome.

Another aim of the invention may include the generation of a transgenic plant and or plant cell cultures that may over express endogenous genes that may be configured to modify cannabinoids. Additional aim may include the co-expression of heterologous transcription factors that may increase cannabinoid production. Another aim of the invention may include the co-expression of heterologous genes that detoxify the hydrogen peroxide byproducts generated through cannabinoid biosynthesis. Co-expression of such genes may be additive with the co-expression of genes configured to modify and/or localize cannabinoid biomodifications.

BRIEF DESCRIPTION OF THE FIGURES

FIG. 1 . Representative Chromatographic Elution profile of CBGA Glycosides found in in vitro Assays. Chromatograms A, B, and C represent respective extracted ion chromatograms for each glycoside product. Chromatogram D is representative of the total ion chromatogram. Peak Intensities are illustrated as relative abundance to most abundant peak in each respective chromatogram.

FIG. 2 . Representative Chromatographic Elution profiles of Functionalized CBGA and Glycosides found in in vitro assays. Chromatograms A, B, and C represent respective extract rated ion chromatograms for each product. Chromatogram D is representative of the total ion chromatogram. Peak Intensities are illustrated as relative abundance to most abundant peak in each respective chromatogram.

FIG. 3 . Representative Chromatographic Elution profile of CBDA Glycosides profiles found in Leaf Extracts. Chromatograms A, B, C, and D represent respective extract rated ion chromatograms for each glycoside product. Chromatogram E is representative of the total ion chromatogram. Peak Intensities are illustrated as relative abundance to most abundant peak in each respective chromatogram.

FIG. 4 . Chromatographic Elution of Functionalized CBDA and Functionalized Glycosides in Leaf Extracts. Chromatograms A, B, and C represent respective extract rated ion chromatograms for each product. Chromatogram D is representative of the total ion chromatogram. Peak Intensities are illustrated as relative abundance to most abundant peak in each respective chromatogram.

FIG. 5 . Gene construct for expression of cytochrome P450 (CYP3A4) gene, (SEQ ID NO. 1), expressing the cytochrome P450 (CYP3A4) protein (SEQ ID NO. 2) and P450 oxidoreductase gene (oxred) (SEQ ID NO. 3) expressing the P450 oxidoreductase protein (SEQ ID NO. 4), in plants. Both genes were driven by the constitutive 35S promoter (35S) and featured 5′ untranslated regions from Arabidopsis thaliana alcohol dehydrogenase (AtADH) as translational enhancers.

FIG. 6 . Confirmation of expression of CYP3A4 and P450 oxidoreductase in tobacco leaves. CB1-CB5, biological replicates of leaves infiltrated with the CYP3A4/P450 oxidoreductase; WT=wild type tobacco leaves with no infiltration. L=1 kb plus ladder (Thermo Fisher Scientific, USA). The arrows show the expected (500 bp) band indicating expression of the transgene.

FIG. 7 . Enhanced glycosylation of cannabinoids in P450-over expressing N. benthamiana plants. CB1-CB5 are biological reps overexpressing CYP3A4+P450 oxidoreductase, P_control is the P19 silencing suppressor (‘empty vector’ control). Vertical axis shows relative amounts expressed as peak area per g fresh weight.

FIG. 8 . Gene construct for the cytosol and suspension culture cannabinoid production system. 35S, Cauliflower mosaic 35S promoter; HSPt, HSP terminator; 35PPDK, hybrid promoter consisting of the cauliflower mosaic virus 35S enhancer fused to the maize C4PPDK basal promoter (Yoo et al. 2007); 76G1, UDP glycosyltransferase from Stevia rebaudiana; ABCG2, human multi-drug transporter.

FIG. 9 . Demonstrates RT-PCR confirmation of expression of CBDA synthase (a), UDP glycosyltransferase (b) and ABCG2 (c) in tobacco leaf cells. L is the 1 kb plus ladder (Thermo Fisher Scientific, USA). Numbers on the lanes represent independent transgenic lines. The arrows point to the expected band that shows expression of the transgene.

FIG. 10 . Hydroxylation and glycosylation of cannabinoids in transgenic tobacco (SUS, numbered) overexpressing CBDA synthase, UDP glycosyltransferase and ABC transporter. WTS1 and 2 are wild type fed with substrate for endogenous reactions. There was some endogenous glycosylation of CBGA, as well as evidence for enhanced transgenic glycosyltransferase activity (e.g. SUS2, SUS3 and SUS4). The data has been corrected to peak area per g fresh weight.

FIG. 11 . Enhanced modification of cannabinoids in transgenic N. benthamiana plants co-infected with constructs for glycosylation, P450-mediated functionalization (hydroxylation) and detoxification of hydrogen peroxide by catalase. SUS=construct for overexpressing CBDA synthase, UDP glycosyltransferase and ABC transporter; M3 S=construct for overexpressing CBDA synthase, UDP glycosyltransferase and ABC transporter with Cannabis MYB12-like and Arabidopsis thaliana catalase.

FIG. 12 . Increased glycosylation activity in transgenic N. benthamiana plants (TSA, TSB, TSC, SUS, SUS/P450) overexpressing a glycosyltransferase compared to wild type in 14-hour transient expression assays.

FIG. 13 . Exemplary monooxygenase reaction, catalyzed by cytochromes P450.

FIG. 14 . Gene construct 1 for the trichome cannabinoid production system. Cauliflower mosaic 35S promoter; AtADH 5′-UTR, translation enhancer element (Matsui et al. 2012); tsCBDAs, cannabidiolic acid synthase with its original trichome target sequence; HSP terminator; tsUGT76G1, UDP glycosyltransferase from Stevia rebaudiana with CBDAs trichome target sequence.

FIG. 15 . Gene construct 2 for the trichome cannabinoid production system. Cauliflower mosaic 35S promoter; AtADH 5′-UTR, enhancer element; PM-UTR1, Arabidopsis thaliana UDP-glucose/galactose transporter targeted to the plasma membrane; HSP terminator.

FIG. 16 . Trichome-targeted CBDA synthase RT-PCR (top), Trichome-targeted UDP glycosyltransferase (76G1) UGT RT-PCR (bottom). A, B, and C are biological replicates collected after 2 DPI.

FIG. 17 . PM-UTR1 RT-PCR. A, B, and C are biological replicates collected after 2 DPI.

FIG. 18 . Gene construct for the cytosolic cannabinoid production system. Cauliflower mosaic 35S promoter; AtADH 5′-UTR, enhancer element; cytCBDAs, cannabidiolic acid synthase with the trichome target sequence removed; HSP terminator; cytUGT76G1, UDP glycosyltransferase from Stevia rebaudiana.

FIG. 19 . SUS-A to SUS-C are biological replicates for the cell suspension (201-SUS) transformation after 1 DPI.

FIG. 20 . cytUGT RT-PCR (top), cytCBDAs RT-PCR (bottom). A, B, and C are biological replicates for cytosolic construct infiltration after 2 DPI.

FIG. 21 . Cannabinoid detection in leaves infiltrated with trichome or cell suspension constructs and fed with CBGA 2.7 mM. The color code refers to the target compartment for CBDAs and UGT76G1 protein accumulation, either trichome or cell suspension cytostol. Y-axis: CBGA and CBDA expressed as parts per million (ppm). Primary, secondary, and acylated glycosides expressed as peak area.

FIG. 22 . Cannabinoid detection in leaves infiltrated with cytosolic or cell suspension construct and fed with CBGA 2.7 mM and UDP-glucose 4 mM. The color code refers to the target compartment for CBDAs and UGT76G1 protein accumulation. Y-axis: CBGA expressed as parts per million (ppm). All other cannabinoid derivatives expressed as peak area (no standards available).

FIG. 23 . Extracted Ion Chromatograms of R—OH Functionalized 1× Glycosylated CBDA Analog. (A) Chromatographic trace, ion m/z, calculated elemental composition, confirming presence of trace levels of CBDA analog (B) Absence of CBDA analog in control extract (C) Absence of CBDA analog in biological duplicate control extract.

FIG. 24 . Direct Infusion Mass Spectrum of Cannabis sativa extract. Spectral insets represent CBDA with a single glycosylation (519.2546 m/z), and CBDA functionalized with R—OH and a single glycosylation (535.2543 m/z). Peak Intensities are illustrated as relative abundance to most intense ion.

FIG. 25 . Relative abundance of CBDA in extracts of various Cannabis sativa strains infiltrated with Agrobacterium cultures harboring CBDA synthase (CBDAs) and UGT plasmid combinations. Normalized relative abundance data is presented as the ion intensity of each compound divided by the ion intensity of the internal standard 7-hydroxycoumarin (20 ppm).

FIG. 26 . Relative abundance of modified CBDA (glycosylated and/or hydroxylated) in extracts of various Cannabis sativa strains infiltrated with Agrobacterium cultures harboring CBDAs and UGT plasmid combinations. Normalized relative abundance data is presented as the ion intensity of each compound divided by the ion intensity of the internal standard 7-hydroxycoumarin (20 ppm).

FIG. 27 . Gene construct used to boost cannabinoid production and mitigate toxicity. CsMYB12, predicted Cannabis sativa MYB transcription factor for enhancing flavonol biosynthesis; HSPt, efficient transcription terminator from the Arabidopsis thaliana heat shock protein 18.2 gene; 35S, constitutive promoter from cauliflower mosaic virus; Catalase, Arabidopsis thaliana catalase gene.

FIG. 28 . Synthesis of THC and CBD from common precursor CBGA.

FIG. 29 . Generation of hydrogen peroxide during cannabinoid biosynthesis.

FIG. 30 . Hydroxylation followed by oxidation of THC by CYP2C9/FIG. 31 . Transfer of a glucuronic acid component to a cannabinoid substrate by UGT.

FIG. 32 . (A) Synthesis Olivetolic Acid a precursor of CBGA; (B) by-product compounds.

FIG. 33 . Amino Acid sequence comparison of exemplary Arabidopsis catalase protein sequences.

FIG. 34 . Schematic diagram of increase cannabinoid production coupled with reduced oxidative damage system in one embodiment thereof

MODE(S) FOR CARRYING OUT THE INVENTION(S)

The present invention includes a variety of aspects, which may be combined in different ways. The following descriptions are provided to list elements and describe some of the embodiments of the present invention. These elements are listed with initial embodiments, however it should be understood that they may be combined in any manner and in any number to create additional embodiments. The variously described examples and preferred embodiments should not be construed to limit the present invention to only the explicitly described systems, techniques, and applications. Further, this description should be understood to support and encompass descriptions and claims of all the various embodiments, systems, techniques, methods, devices, and applications with any number of the disclosed elements, with each element alone, and also with any and all various permutations and combinations of all elements in this or any subsequent application.

The inventive technology includes systems and methods for high-level production of cannabinoid compounds. As used herein, the term “high level” in this instance may mean higher than wild-type biosynthesis or accumulation of one or more cannabinoids in a plant or plant cell. In one embodiment, a suspension or hairy root or cell suspension culture of one or more plant strains may be established. In one preferred embodiment, a suspension or hairy root or cell suspension culture of one or more Cannabis or tobacco plant strains may be established. It should be noted that the term strain may refer to a plant strain, as well as a cell culture, or cell line derived from a plant, such as Cannabis.

In one preferred embodiment, a suspension or hairy root or cell suspension culture of Cannabis sativa or tobacco plant may be established in a fermenter or other similar apparatus. It should be noted that the use of C. sativa in this embodiment is exemplary only. For example, in certain other embodiments, various Cannabis strains, mixes of strains, hybrids of different strains or clones, as well as different varieties may be used to generate a suspension or hairy root culture. For example, strains such as C. sativa, C. indica and C. ruderalis may all be used with the inventive technology. In yet further embodiments, other cannabinoid or cannabinoid-like producing plants may be used. For example, in a certain embodiment a cell suspension or hairy root culture may be established for one or more of the following: Echinacea; Acmella oleracea; Helichrysum umbraculigerum; Radula marginata (Liverwort), Theobroma cacao or tobacco.

In certain embodiments, such fermenters may include large industrial-scale fermenters allowing for a large quantity of cannabinoid producing C. sativa cells to be cultured. In this embodiment, it may be possible to culture a large quantity of unadulterated cells from a single-strain of, for example, tobacco or C. sativa, which may establish a cell culture having a consistent production and/or modification of cannabinoid compounds in both quantity and type. Such cultured growth may be continuously sustained with the supplementation of nutrient and other growth factors to the culture. Such features may be automated or accomplished manually.

Another embodiment of the inventive technology may include systems and methods for high level production of modified cannabinoid compounds. In one embodiment, a suspension or hairy root culture of one or more tobacco plant strains may be established. It should be noted that the term strain may refer to a plant strain, as well as a cell culture, or cell line derived from a tobacco plant. In one preferred embodiment, a suspension or hairy root culture of Nicotiana benthamiana plant may be established in a fermenter or other similar apparatus. It should be noted that the use of N. benthamiana in this embodiment is exemplary only. For example, in certain other embodiments, various Nicotiana strains, mixes of strains, hybrids of different strains or clones, as well as different varieties may be used to generate a cell suspension or hairy root culture.

In certain cases, such fermenters may include large industrial-scale fermenters allowing for a large quantity of N. benthamiana cells to be cultured. In this embodiment, harvested cannabinoids may be introduced to this suspension culture, and modified as generally described herein. Similarly, such cultured growth of tobacco cells may be continuously sustained with the continual addition of nutrient and other growth factors being added to the culture. Such features may be automated or accomplished manually.

Another embodiment of the invention may include the production of genetically modified Cannabis and/or tobacco cells to express varying exogenous and/or endogenous genes that may modify the chemical structure of cannabinoid compounds. Such transgenic strains may be configured to produce and/or modify large quantities of cannabinoid compounds generally, as well as targeted increases in the production of specific cannabanoids such as THC, Cannabidiol (CBD) or Cannabinol (CBN) and the like.

Another embodiment of the invention may include the production of genetically modified Cannabis cell cultures that express a mix of cannabinoids that may be optimized for the treatment of specific medical conditions. For example, CBD is a non-psychoactive cannabinoid that may be used to treat seizures in those with epilepsy. However, decades of selective breeding has resulted in the majority of Cannabis strains having low concentrations of CBD when compared to the psychoactive cannabinoid THC. As such, in certain embodiments, disease or syndrome specific cell cultures may be developed that express a calibrated mix of cannabinoids for the downstream treatment of such conditions.

Additional embodiments of the inventive technology may include novel systems, methods and compositions for the production and in vivo modification of cannabinoid compounds in a plant system. In certain embodiment, these in vivo modifications may lead to the production of different forms of cannabinoids with special properties, e.g. water-soluble, slow-release cannabinoids or prodrugs. In one preferred embodiment, the inventive technology may include novel systems, methods and compositions for the hydroxylation, acetylation and/or glycosylation. Modified cannabinoids can be made water-soluble, for example by glycosylation.

As noted above, production and/or accumulation of high-levels of cannabinoids would be toxic for a plant cell host. As such, one embodiment of the inventive technology may include systems and methods to transiently modify cannabinoids in vivo. One aim of the current invention may include the use of cytochrome P450's (CYP) monooxygenases to transiently modify or functionalize the chemical structure of the cannabinoids. CYPs constitute a major enzyme family capable of catalyzing the oxidative biotransformation of many pharmacologically active chemical compounds and other lipophilic xenobiotics. For example, as shown in FIG. 13 , the most common reaction catalyzed by cytochromes P450 is a monooxygenase reaction, e.g., insertion of one atom of oxygen into the aliphatic position of an organic substrate (RH) while the other oxygen atom is reduced to water.

Several cannabinoids, including THC, have been shown to serve as a substrate for human CYPs (CYP2C9 and CYP3A4). Similarly, CYPs have been identified that metabolize cannabidiol (CYPs 2C19, 3A4); cannabinol (CYPs 2C9, 3A4); JWH-018 (CYPs 1A2, 2C9); and AM2201 (CYPs 1A2, 2C9). For example, as shown generally in FIG. 30 , in one exemplary system, CYP2C9 may “functionalize” or hydroxylate a THC molecule resulting in a hydroxyl-form of THC. Further oxidation of the hydroxyl form of THC by CYP2C9 may convert it into a carboxylic-acid form which loses its psychoactive capabilities, rendering it an inactive metabolite.

As such, another embodiment of the invention may include the creation of a Cannabis strain or cell culture that may be transformed with artificially created genetic constructs encoding one or more exogenous CYPs. In one preferred embodiment, genes encoding one or more non-human isoforms and/or analogs, as well as possibly other CYPs that may functionalize cannabinoids, may be expressed in transgenic Cannabis sativa or other plant. In another preferred embodiment, genes encoding one or more non-human isoforms and/or analogs, as well as possibly other CYPs that may functionalize cannabinoids, may be expressed in transgenic Cannabis sativa or tobacco strains grown in a suspension culture. Additional embodiments may include genetic control elements such as promotors and/or enhancers as well as post-transcriptional regulatory elements that may also be expressed in transgenic Cannabis strains such that the presence, quantity and activity of any CYPs present in the suspension or hairy root culture may be modified and/or calibrated.

Another embodiment of the invention may include the creation of a tobacco strain or cell culture may be transformed with artificially created genetic constructs encoding one or more exogenous CYPs. In one preferred embodiment, genes encoding one or more non-human isoforms and/or analogs, as well as possibly other CYPs that may functionalize cannabinoids introduced to a transgenic N. benthamiana plant or suspension culture. Additional embodiments may include genetic control elements such as promotors and/or enhancers as well as post-transcriptional regulatory elements that may also be expressed in transgenic N. benthamiana strains such that the presence, quantity and activity of any CYPs present in the suspension or hairy root culture may be modified and/or calibrated.

Another aim of the invention may be to further modify, in vivo, cannabinoids and/or already functionalized cannabinoids. In a preferred embodiment, glycosylation of cannabinoids and/or functionalized cannabinoids may covert to them into a water-soluble form. In an exemplary embodiment shown in FIG. 31 , the inventive technology may utilize one or more glycosyltransferase enzymes, such as UDP-glycosyltransferase (UGT), to catalyze, in vivo the glucuronosylation or glucuronidation of cannabinoids, such as primary (CBD, CBN) and secondary cannabinoids (THC, JWH-018, JWH-073). In this embodiment, glucuronidation may consist of the transfer of a glucuronic acid component of uridine diphosphate glucuronic acid to a cannabinoid substrate by any of several types of glycosyltransferases as described herein. Glucuronic acid is a sugar acid derived from glucose, with its sixth carbon atom oxidized to a carboxylic acid.

Yet another embodiment of the current invention may include the in vivo conversion of a functionalized cannabinoid, in this example a carboxylic acid form of the cannabinoid, to a glycosylated form of cannabinoid that may be both water-soluble and non-toxic to the cell host. These chemical modifications may allow for greater levels of cannabinoid accumulation in a plant cell culture without the deleterious cytotoxic effects that would be seen with unmodified cannabinoids due to this water-solubility.

Another embodiment of the invention may include the generation of transgenic or genetically modified strains of Cannabis, or other plants such as tobacco, having artificial genetic constructs that may express one or more genes that may increase cannabinoids solubility and/or decrease cannabinoid cytotoxicity. For example, the inventive technology may include the generation of transgenic plant strains or cell lines having artificial genetic constructs that may express one or more endogenous/or exogenous glycosyltransferases or other enzymes capable of glycosylating cannabinoid compounds. For example, in one embodiment one or more glycosyltransferases from N. benthamiana, or other non-cannabis plants may be introduced into a cannabis plant or cell culture and configured to glycosylate cannabinoids in vivo. In other embodiment, endogenous glycosyltransferases from N. benthamiana may be over-expressed to as to increase in vivo cannabinoid glycosylation.

In an additional embodiment, of the inventive technology may include the generation of artificial genetic constructs having genes encoding one or more glycosyltransferases, including non-human analogues of those described herein as well as other isoforms, that may further may be expressed in transgenic Cannabis sativa, N. benthamiana or other plant system which may further be grown in a suspension culture. Additional embodiments may include genetic control elements such as promotors and/or enhancers as well as post-transcriptional regulatory control elements that may also be expressed in a transgenic plant system such that the presence, quantity and activity of any glycosyltransferases present in the suspension or hairy root culture may be regulated.

An additional embodiment of the invention may include artificial genetic constructs having one or more genes encoding one or more UDP- and/or ADP-glycosyltransferases having localization sequences or domains that may assist in the movement of the protein to a certain portion of the cell, such as the cellular locations were cannabinoids and/or functionalized cannabinoids may be modified, produced, stored, and/or excreted from the cell.

An additional embodiment of the invention may include artificial genetic constructs having one or more genes encoding one or more UDP- and/or ADP-glycosyltransferases being co-expressed with one or more exogenous genes that may assist in the movement of the protein to a certain portion of the cell, such as the cellular locations were cannabinoids and/or functionalized cannabinoids may be stored, and/or excreted from the cell.

One preferred embodiment of the inventive technology may include the high level in vivo production of water-soluble, glycosylated cannabinoids, generally being referred to as transiently modified cannabinoids that may be harvested from a plant or a cell culture. In one embodiment, transiently modified cannabinoids may accumulate within the cell that is part of a suspension culture. In this example, the cell culture may be allowed to grow to a desired level of cell or optical density, or in other instances until a desired level of transiently modified cannabinoids have accumulated in the cultured Cannabis cells. Such exogenous genes may be localized, for example to the cytosol or trichome as generally described herein, and may further be co-expressed with other exogenous genes that may reduce cannabinoid biosynthesis toxicity and/or facilitate cannabinoid transport through, or out of the cell.

All or a portion of the Cannabis cells containing the accumulated transiently modified cannabinoids may then be harvested from the culture, which in a preferred embodiment may be an industrial-scale fermenter or other apparatus suitable for the large-scale culturing of plant cells. The harvested Cannabis cells may be lysed such that the accumulated transiently modified cannabinoids may be released to the surrounding lysate. Additional steps may include treating this lysate. Examples of such treatment may include filtering or screening this lysate to remove extraneous plant material as well as chemical treatments to improve later cannabinoid yields.

Another embodiment of inventive technology may include the high level in vivo generation of water-soluble, glycosylated cannabinoids, generally being referred to as transiently modified cannabinoids that may be harvested from a plant or a cell culture. In one embodiment, cannabinoids may be introduced to a non-cannabinoid producing cell culture, such as N. benthamiana. In this preferred embodiment, the non-cannabinoid producing cell culture may be genetically modified to express one or more endogenous or exogenous genes that may modify the cannabinoids, for example through hydroxylation, acetylation and/or glycosylation. Such endogenous or exogenous genes may be localized, for example to the cytosol or trichome as generally described herein, and may further be co-expressed with other exogenous genes that may reduce cannabinoid biosynthesis toxicity and/or facilitate cannabinoid transport through, or out of the cell.

This non-cannabinoid producing the cell culture may be allowed to grow to a desired level of cell or optical density, or in other instances until a desired level of transiently modified cannabinoids have accumulated in the cultured cells. All or a portion of the N. benthamiana cells containing the accumulated cannabinoids may then be harvested from the culture, which in a preferred embodiment may be an industrial-scale fermenter or other apparatus suitable for the large-scale culturing of plant cells. The harvested N. benthamiana cells may be lysed such that the accumulated transiently modified cannabinoids may be released to the surrounding lysate. Additional steps may include treating this lysate. Examples of such treatment may include filtering or screening this lysate to remove extraneous plant material as well as chemical treatments to improve later cannabinoid yields.

Another aim of the inventive technology may include methods to isolate and purified transiently modified cannabinoids from a plant or suspension culture. In one preferred embodiment, a Cannabis lysate may be generated and processed utilizing affinity chromatography or other purification methods. In this preferred embodiment, an affinity column having a ligand or protein receptor configured to bind with the transiently modified cannabinoids, for example through association with a glycosyl or glucuronic acid functional group among others, may be immobilized or coupled to a solid support. The lysate may then be passed over the column such that the transiently modified cannabinoids, having specific binding affinity to the ligand become bound and immobilized. In some embodiments, non-binding and non-specific binding proteins that may have been present in the lysate may be removed. Finally, the transiently modified cannabinoids may be eluted or displaced from the affinity column by, for example, a corresponding sugar or other compound that may displace or disrupt the cannabinoid-ligand bond. The eluted transiently modified cannabinoids may be collected and further purified or processed.

An aim of the invention may include an embodiment where transiently modified cannabinoids may be passively and/or actively excreted from a cell or into a cell wall. In one exemplary model, an exogenous ATP-binding cassette transporter (ABC transporters) or other similar molecular structure may recognize the glycosyl or glucuronic acid functional group (conjugate) on the transiently modified cannabinoid and actively transport it across the cell wall/membrane and into the surrounding media. In this embodiment, the cell culture may be allowed to grow until an output parameter is reached. In one example, an output parameter may include allowing the cell culture to grow until a desired cell/optical density is reach, or a desired concentration of transiently modified cannabinoid is reached. In this embodiment, the culture media containing the transiently modified cannabinoids may be harvested for later cannabinoid extraction. In some embodiments, this harvested media may be treated in a manner similar to the lysate generally described above. Additionally, the transiently modified cannabinoids present in the raw and/or treated media may be isolated and purified, for example, through affinity chromatography in a manner similar to that described above.

In certain embodiments, this purified cannabinoid isolate may contain a mixture of primary and secondary glycosylated cannabanoids. As noted above, such purified glycosylated cannabinoids may be water-soluble and metabolized slower than unmodified cannabinoids providing a slow-release capability that may be desirable in certain pharmaceutical applications, such as for use in tissue-specific applications, or as a prodrug. As such, it is one aim of the invention to incorporate such purified glycosylated cannabinoids into a variety of pharmaceutical and/or nutraceutical applications.

For example, the purified glycosylated cannabinoids may be incorporated into various solid and/or liquid delivery vectors for use in pharmaceutical applications. As noted above, these transiently modified cannabinoids may no longer possess their psychoactive component, making their application in research, therapeutic and pharmaceutical applications especially advantageous. For example, the treatment of children may be accomplished through administration of a therapeutic dose of isolated and purified transiently modified cannabinoids, without the undesired psychoactive effect. Additional therapeutic applications may include the harvesting and later administration of a therapeutic dose of an “entourage” of isolated and purified transiently modified cannabinoids.

Another embodiment of the invention may include a system to convert or reconstitute transiently modified cannabinoids. In one preferred embodiment, glycosylated cannabinoids may be converted into non-glycosylated cannabinoids through their treatment with one or more generalized or specific glycosidases. The use and availability of glycosidase enzymes would be recognized by those in the art without requiring undue experimentation. In this embodiment, these glycosidase enzymes may remove a sugar moiety. Specifically, these glycosidases may remove the glycosyl or glucuronic acid moiety reconstituting the cannabinoid compound to a form exhibiting psychoactive activity. This reconstitution process may generate a highly purified “entourage” of primary and secondary cannabinoids. These reconstituted cannabinoid compounds may also be incorporated into various solid and/or liquid delivery vectors for use in a variety of pharmaceutical and other commercial applications.

As noted above, in one embodiment of the invention, cannabinoid producing strains of Cannabis, as well as other plants may be utilized with the inventive technology. In certain preferred embodiments, in lieu of growing the target cannabinoid producing plant in a cell culture, the raw plant material may be harvested and undergo cannabinoid extraction utilizing one or more of the methods described herein. These traditionally extracted cannabinoids may then be modified from their native forms through the in vitro application of one or more CYP's that may generate hydroxyl and carboxylic acid forms of these cannabinoids respectively. These functionalized cannabinoids may be further modified through the in vitro application of one or more glycosyltransferases as generally described herein. In this embodiment, the new transiently modified cannabinoids may be isolated and purified through a process of affinity chromatography, or other extraction protocol, and then applied to various commercial and other therapeutic uses. In other embodiments, the transiently modified cannabinoids may be restored and reconstituted through the in vitro application of one or more glycosidase enzymes. These restored cannabinoids may also be applied to various commercial and other therapeutic uses.

Another embodiment of the invention may include the use of other non-cannabinoid producing plants in lieu of growing a cannabinoid producing plant in a cell culture. Here, cannabinoid may be introduced to genetically modified plants, or plant cell cultures that express one or more CYP's that may generate hydroxyl and carboxylic acid forms of these cannabinoids respectively. These functionalized cannabinoids may be further modified through the action of one or more glycosidases that may also be expressed in the non-cannabinoid producing plant or cell culture. In one preferred embodiment, a non-cannabinoid producing cell culture may include tobacco plant or cell cultures.

One embodiment of the invention may include an in vivo method of trichome-targeted cannabinoid accumulation and modification. One preferred embodiment of this in vivo system may include the creation of a recombinant protein that may allow the translocation of a CYP or glycosyltransferases to a site of extracellular cannabinoid synthesis in a whole plant. More specifically, in this preferred embodiment, one or more CYPs or glycosyltransferases may either be engineered to express all or part of the N-terminal extracellular targeting sequence as present in cannabinoid synthase protein, such as THCA synthase or CBDA synthase.

One another embodiment of the invention may include an in vivo method of high-level trichome-targeted cannabinoid biosynthesis, accumulation and/or modification. One preferred embodiment of this in vivo system may include the creation of a recombinant protein that may allow the translocation of a catalase to a site of extracellular cannabinoid synthesis in a whole plant. More specifically, in this preferred embodiment, one or more catalase enzymes may either be engineered to express all or part of the N-terminal extracellular targeting sequence as present in cannabinoid synthase protein, such as THCA synthase or CBDA synthase. In this embodiment, the catalase may be targeted to the site of cannabinoid biosynthesis allowing it to more efficiently neutralize hydrogen peroxide byproducts.

In this preferred embodiment, this N-terminal trichome targeting sequence or domain may generally include the first 28 amino acid residues of a generalized synthase. An exemplary trichome targeting sequence for THCA synthase is identified SEQ ID NO. 40, while trichome targeting sequence for CBDA synthase is identified SEQ ID NO. 41. This extracellular targeting sequence may be recognized by the plant cell and cause the transport of the glycosyltransferase from the cytoplasm to the plant's trichrome, and in particular the storage compartment of the plant trichrome where extracellular cannabinoid glycosylation may occur. More specifically, in this preferred embodiment, one or more glycosyltransferases, such as UDP glycosyltransferase may either be engineered to express all or part of the N-terminal extracellular targeting sequence as present in an exemplary synthase enzyme.

Another embodiment of the invention may include an in vivo method of cytosolic-targeted cannabinoid production, accumulation and/or modification. One preferred embodiment of this in vivo system may include the creation of a recombinant protein that may allow the localization of cannabinoid synthases and/or glycosyltransferases to the cytosol.

More specifically, in this preferred embodiment, one or more cannabinoid synthases may be modified to remove all or part of the N-terminal extracellular targeting sequence. An exemplary trichome targeting sequence for THCA synthase is identified SEQ ID NO. 40, while trichome targeting sequence for CBDA synthase is identified SEQ ID NO. 41. Co-expression with this cytosolic-targeted synthase with a cytosolic-targeted CYP or glycosyltransferase, may allow the localization of cannabinoid synthesis, accumulation and modification to the cytosol. Such cytosolic target enzymes may be co-expressed with catalase, ABC transporter or other genes that may reduce cannabinoid biosynthesis toxicity and or facilitate transport through or out of the cell.

Another embodiment of the invention may include the generation of an expression vector comprising this polynucleotide, namely a cannabinoid synthase N-terminal extracellular targeting sequence and glycosyltransferase genes, operably linked to a promoter. A genetically altered plant or parts thereof and its progeny comprising this polynucleotide operably linked to a promoter, wherein said plant or parts thereof and its progeny produce said chimeric protein, is yet another embodiment. For example, seeds and pollen contain this polynucleotide sequence or a homologue thereof, a genetically altered plant cell comprising this polynucleotide operably linked to a promoter such that said plant cell produces said chimeric protein. Another embodiment comprises a tissue culture comprising a plurality of the genetically altered plant cells.

Another embodiment of the invention provides for a genetically altered plant or cell expressing a chimeric or fusion protein having a cannabinoid synthase N-terminal extracellular targeting sequence (see i.e., SEQ ID: 40-41; see also SEQ ID NO. 42 for full amino acid sequence of THCA synthase) coupled with a UDP glycosyltransferase genes, operably linked to a promoter. Another embodiment provides a method for constructing a genetically altered plant or part thereof having glycosylation of cannabinoids in the extracellular storage compartment of the plant's trichrome compared to a non-genetically altered plant or part thereof, the method comprising the steps of: introducing a polynucleotide encoding the above protein into a plant or part thereof to provide a genetically altered plant or part thereof, wherein said chimeric protein comprising a first domain, a second domain, and wherein said first domain comprises a cannabinoid synthase N-terminal extracellular targeting sequence, and a second domain comprises a glycosyltransferase sequence. These domains may be separated by a third domain or linker. This linker may be any nucleotide sequence that may separate a first domain from a second domain such that the first domain and the second domain can each fold into its appropriate three-dimensional shape and retain its activity.

One preferred embodiment of the invention may include a genetically altered plant or cell expressing a cytosolic-targeted cannabinoid synthase protein having a cannabinoid synthase N-terminal extracellular targeting sequence (SEQ IDs. 40-41) inactivated or removed. In one embodiment, a cytosolic targeted THCA synthase (ctTHCAs) may be identified as SEQ ID NO. 46, while in another embodiment cytosolic targeted CBDA synthase (cytCBDAs) is identified as SEQ ID NO. 22-23). Such cytosolic-targeted cannabinoid synthase protein may be operably linked to a promoter. Another embodiment provides a method for constructing a genetically altered plant or part thereof having glycosylation of cannabinoids in the plant's cytosol compared to a non-genetically altered plant or part thereof, the method comprising the steps of: introducing a polynucleotide encoding the above protein into a plant or part thereof to provide a genetically altered plant or part thereof, wherein said a cannabinoid synthase N-terminal extracellular targeting sequence has been disrupted or removed.

Yet another embodiment of the invention may include an in vivo method of cannabinoid glycosylation in a cannabis cell culture. In one preferred embodiment, to facilitate glycosylation of cannabinoids in cannabis cell culture, which would lack an extracellular trichrome structure, a cannabinoid synthase gene may be genetically modified to remove or disrupt, for example through a directed mutation, the extra-cellular N-terminal targeting domain which may then be used to transform a Cannabis plant cell in a cell culture. In this embodiment, without this targeting domain the cannabinoid synthase, for example THCA or CBDA synthases, may remain within the plant cell, as opposed to being actively transported out of the cell, where it may be expressed with one or more glycosyltransferases, such as UDP glycosyltransferase in the cytoplasm.

Another embodiment of the inventive technology may include systems and methods for enhanced production and/or accumulation of cannabinoid compounds in an in vivo system. In one preferred embodiment, the invention may include the generation of a genetically modified or transgenic Cannabis plant that may produce and/or accumulate one or more cannabinoids at higher than wild-type levels. In one embodiment, a transgenic Cannabis plant may be generated to express one or more Cannabis sativa transcription factors that may enhance the cannabinoid metabolic pathway(s). In one preferred embodiment, a polynucleotide may be generated that encodes for one or more Cannabis sativa myb transcription factors genes, and/or one or more exogenous ortholog genes that enhance the metabolite flux through the cannabinoid biosynthetic pathway.

In this preferred embodiment, a polynucleotide may be generated that encodes for one or more Cannabis sativa myb transcription factors genes, such as CAN833 and/or CAN738 that. As shown in FIG. 32 , these transcriptions factors may drive the production of olivetolic acid, which is a precursor of CBGA, which in turn is a precursor in the biosynthetic pathway of THCs, CBDs and CBC. In an alternative embodiment, a polynucleotide may be generated that encodes for one or more Cannabis sativa myb transcription factors genes orthologs, specifically cannabis Myb12 (SEQ IDs. 11-12), Myb8 (SEQ ID NO. 43), AtMyb12 (SEQ ID NO. 44), and/or MYB112 (SEQ ID NO. 45) that may also drive the production of olivetolic acid, which is a precursor of CBGA, which in turn is a precursor in the biosynthetic pathway of THCs, CBDs and CBC.

In one preferred embodiment, the invention may include methods of generating a polynucleotide that expresses one or more of the SEQ IDs related to enhanced cannabinoid production identified herein. In certain preferred embodiments, the proteins of the invention may be expressed using any of a number of systems to obtain the desired quantities of the protein. Typically, the polynucleotide that encodes the protein or component thereof is placed under the control of a promoter that is functional in the desired host cell. An extremely wide variety of promoters may be available, and can be used in the expression vectors of the invention, depending on the particular application. Ordinarily, the promoter selected depends upon the cell in which the promoter is to be active. Other expression control sequences such as ribosome binding sites, transcription termination sites and the like are also optionally included. Constructs that include one or more of these control sequences are termed “expression cassettes” or “constructs.” Accordingly, the nucleic acids that encode the joined polypeptides are incorporated for high level expression in a desired host cell.

Additional embodiments of the invention may include selecting a genetically altered plant or part thereof that expresses the cannabinoid production transcription factor protein, wherein the expressed protein has increased cannabinoid biosynthesis capabilities. In certain embodiments, a polynucleotide encoding the cannabinoid production transcription factor protein is introduced via transforming said plant with an expression vector comprising said polynucleotide operably linked to a promoter. The cannabinoid production transcription factor protein may comprise a SEQ ID selected from the group consisting of SEQ ID NO: 11-2 or 43-45, or a homologue thereof.

As noted above, one embodiment of the invention may include systems and methods for general and/or localized detoxification of cannabinoid biosynthesis in an in vivo system. In one preferred embodiment, the invention may include the generation of a genetically modified or transgenic Cannabis or other plant that may be configured to be capable of detoxifying hydrogen peroxide by-products resulting from cannabinoid biosynthesis at higher than wild-type levels. In addition, this detoxification may be configured to be localized to the cytosol and/or trichome structure of the Cannabis plant where cannabinoids are actively being synthesized in a whole plant system. In this preferred embodiment of the invention, a transgenic plant, such as a cannabis or tobacco plant or cell, that express one or more genes that may up-regulate hydrogen peroxide detoxification.

In one preferred embodiment, a polynucleotide may be generated that encodes for one or more endogenous and/or exogenous transcription catalase genes, and/or orthologs that catalyze the reduction of hydrogen peroxide:

As such, in one embodiment, the invention comprises the generation of a polynucleotide encoding a exogenous catalase protein that may be expressed within a transformed plant and/or cell culture. In a preferred embodiment, a catalase enzyme configured reduce hydrogen peroxide (H₂O₂) generated during cannabinoid synthesis may be used to transform a cannabis or other plant, such as a tobacco plant. While a number of generic catalase enzymes may be included in this first domain, as merely one exemplary model, a first domain may include an exogenous catalase derived from Arabidopsis (SEQ ID NO. 13-14; see also FIG. 33 ), or Escherichia coli (SEQ ID NO. 15-16), or any appropriate catalase ortholog, protein fragment, or catalases with a homology between about 70%-and approximately 100% as herein defined.

Another embodiment of the current invention may include localization of the catalase enzyme to a trichome structure. As generally outlined above, in this embodiment a trichome targeting sequence from a cannabinoid synthase may be coupled with one or more catalase enzymes in a fusion or chimera—the terms being generally interchangeable in this application. This artificial trichome-target catalase gene may be used to transform a plant having trichome structures, such as Cannabis or tobacco. In a preferred embodiment, a trichome-targeted catalase from Arabidopsis thaliana with a THCA synthase trichome targeting domain is identified as SEQ ID NO. 47, while a trichome-targeted catalase Arabidopsis thaliana with a CBDA synthase trichome targeting domain is identified as SEQ ID NO. 48. In another embodiment, a trichome-targeted catalase from Escherichia coli with a THCA synthase trichome targeting domain is identified as SEQ ID NO. 49, while a trichome-targeted catalase Escherichia coli with a CBDA synthase trichome targeting domain is identified as SEQ ID NO. 50.

Another embodiment of the invention comprises generating a polynucleotide of a nucleic acid sequence encoding the chimeric/fusion catalase protein. Another embodiment includes an expression vector comprising this polynucleotide operably linked to a promoter. A genetically altered plant or parts thereof and its progeny comprising this polynucleotide operably linked to a promoter, wherein said plant or parts thereof and its progeny produce said fusion protein is yet another embodiment. For example, seeds and pollen contain this polynucleotide sequence or a homologue thereof, a genetically altered plant cell comprising this polynucleotide operably linked to a promoter such that said plant cell produces said chimeric protein. Another embodiment comprises a tissue culture comprising a plurality of the genetically altered plant cells.

In a preferred embodiment, a polynucleotide encoding a trichome-targeted fusion protein may be operably linked to a promoter that may be appropriate for protein expression in a Cannabis, tobacco or other plant. Exemplary promotors may include, but not be limited to: a non-constitutive promotor; an inducible promotor, a tissue-preferred promotor; a tissue-specific promotor, a plant-specific promotor, or a constitutive promotor. In a preferred embodiment, one or more select genes may be operably linked to a leaf-specific gene promotor, such as Cab 1. Additional promoters and operable configurations for expression, as well as co-expression of one or more of the selected genes are generally known in the art.

Another embodiment of the invention may provide for a method for constructing a genetically altered plant or part thereof having increased resistance to hydrogen peroxide cytotoxicity generated during cannabinoid synthesis compared to a non-genetically altered plant or part thereof, the method comprising the steps of: introducing a polynucleotide encoding a fusion protein into a plant or part thereof to provide a genetically altered plant or part thereof, wherein said fusion protein comprising a catalase and a trichome-targeting sequence from a cannabinoid synthase.

In one embodiment, the invention may encompass a system to increase overall cannabinoid production and accumulation in trichomes while preventing potential cytotoxicity effects. As generally shown in FIG. 34 , the system may include, in a preferred embodiment, creating a transgenic Cannabis, tobacco or other plant or suspension culture plant that overexpresses at least one Myb transcription factor to increase overall cannabinoid biosynthesis In further preferred embodiments, this transgenic plant may co-express a catalase enzyme to reduce oxidative damage resulting from hydrogen peroxide production associated with cannabinoid synthesis reducing cell toxicity. In certain preferred embodiments, this catalase may be fused with an N-terminal synthase trichome targeting domain, for example from THCA and/or CBDA synthase, helping localize the catalase to the trichome in the case of whole plant systems, and reduce potentially toxic levels of hydrogen peroxide produced by THCA, CBCA and/or CBDA synthase activity.

Another embodiment of the invention may comprise a combination polynucleotide of a nucleic acid sequence encoding a combination of: 1) a cannabinoid production transcription factor protein, such as a myb gene; and/or a catalase protein, or any homologue thereof, which may further include a trichome targeting or localization signal. A genetically altered plant or parts thereof and its progeny comprising this combination polynucleotide operably linked to a promoter, wherein said plant or parts thereof and its progeny produce said protein is yet another embodiment. For example, seeds and pollen contain this polynucleotide sequence or a homologue thereof, a genetically altered plant cell comprising this polynucleotide operably linked to a promoter such that said plant cell produces said proteins. Another embodiment comprises a tissue culture comprising a plurality of the genetically altered plant cells.

Another embodiment of the invention may provide for a method for constructing a genetically altered plant or part thereof having: 1) increased cannabinoid production compared to a non-genetically altered plant or part thereof; and/or and 2) increased resistance to hydrogen peroxide cytotoxicity generated during cannabinoid synthesis compared to a non-genetically altered plant or part thereof, the method comprising the steps of: introducing a combination polynucleotide into a plant or part thereof to provide a genetically altered plant or part thereof.

Additional embodiments of the invention may include selecting a genetically altered plant or part thereof that expresses one or more of the proteins, wherein the expressed protein(s) may have: 1) increased cannabinoid production capabilities, for example through overexpression of an endogenous myb gene; and 2) catalase with/or without a trichome localization capability, or any combination thereof. In certain embodiments, a combination polynucleotide encoding the proteins is introduced via transforming said plant with an expression vector comprising said combination polynucleotide operably linked to a promoter. The cannabinoid production transcription factor protein may comprise a SEQ ID selected from the sequences identified herein, or homologues thereof. Naturally, such combinations and expression combination strategies, such identified in Tables 7-8, 10 below and elsewhere, are exemplary, as multiple combinations of the elements as herein described is included in the invention.

In one preferred embodiment, the inventive technology may include systems, methods and compositions high levels of in vivo cannabinoid hydroxylation, acetylation and/or glycosylation and/or a combination of all three. In a preferred embodiment, the in vivo cannabinoid hydroxylation, acetylation and/or glycosylation and/or a combination of all three may occur in a cannabinoid-producing plant or cell culture system. While in alternative embodiments may include a non-cannabinoid producing plant or cell culture system such as a tobacco plant, like N. benthamiana.

In one embodiment, the invention may include a cannabinoid production, accumulation and modification system. In one preferred embodiment, a plant, such as cannabis or tobacco, may be genetically modified to express one or more heterologous cytochrome P450 genes. In this preferred embodiment, a heterologous human cytochrome P450 (CYP3A4) SEQ ID NO. 1 may be expressed in a cannabinoid-producing plant or cell culture system. While in alternative embodiments a heterologous human cytochrome P450 (CYP3A4) may be expressed non-cannabinoid producing plant or cell culture system such as a tobacco plant, like N. benthamiana. In this embodiment, the overexpression of a heterologous human cytochrome P450 protein, identified as SEQ ID NO. 2, may functionalize endogenously-created cannabinoids so that they can be more efficiently glycosylated and/or acetylated in vivo, rendering them water-soluble.

In an alternative embodiment, the invention may include a cannabinoid production, accumulation and modification system. In one preferred embodiment, a plant, such as cannabis or tobacco, may be genetically modified to express one or more heterologous cytochrome P450 oxidoreductase genes. In this preferred embodiment, a heterologous cytochrome P450 oxidoreductase (oxred) identified as SEQ ID NO. 3, may be expressed in a cannabinoid-producing plant or cell culture system. While in alternative embodiments a heterologous human heterologous cytochrome P450 oxidoreductase (oxred) may be expressed non-cannabinoid producing plant or cell culture system such as a tobacco plant, like N. benthamiana. In this embodiment, the overexpression of a heterologous cytochrome P450 oxidoreductase (oxred) protein, identified as SEQ ID NO. 4, may functionalize endogenously-created cannabinoids so that they can be more efficiently glycosylated and/or acetylated in vivo, rendering them water-soluble.

In one embodiment, the invention may include a cannabinoid production, accumulation and modification system in a non-cannabinoid producing plant. In one preferred embodiment, a plant, such as tobacco, may be genetically modified to express one or more heterologous cytochrome P450 oxidoreductase genes. In this preferred embodiment, a heterologous cytochrome P450 oxidoreductase (oxred) identified as SEQ ID NO. 3 may be expressed in a cannabinoid-producing plant or cell culture system. In alternative embodiments, While in alternative embodiments a heterologous cytochrome P450 oxidoreductase (oxred) may be expressed non-cannabinoid producing plant or cell culture system such as a tobacco plant, like N. benthamiana. In this embodiment, the overexpression of a heterologous cytochrome P450 oxidoreductase (oxred) protein, identified as SEQ ID NO. 4, may help to functionalize cannabinoids introduced to the genetically modified plant or plant cell culture system so that they can be more efficiently glycosylated and/or acetylated, in vivo, rendering them water-soluble.

In a preferred embodiment cytochrome 450 and P450 oxidoreductase are co-expressed.

In another embodiment, the invention may include the expression of one or more exogenous or heterologous, the terms being generally interchangeable, cannabinoid synthase gene in a non-cannabinoid producing plant or plant-cell culture system. In one preferred embodiment, such a gene may include one or more of a CBG, THCA, CBDA or CBCA synthase genes. For example in one embodiment, a Cannabidiolic acid (CBDA) synthase, identified as SEQ ID NO. 5 (gene) or SEQ ID NO. 6 (protein) from Cannabis sativa may use expressed in a non-cannabis-producing plant, such as or plant cell suspension culture of N. benthamiana. In another preferred embodiment, a Tetrahydrocannabinolic acid (THCA) synthase, identified as SEQ ID NO. 42 (gene) from Cannabis sativa may use expressed in a non-cannabis-producing plant, such as a plant cell suspension culture of N. benthamiana.

In another preferred embodiment, such cannabinoid synthase genes expressed in a cannabinoid and/or non-cannabinoid plant or plant-cell suspension culture may be target or localized to certain parts of a cell. For example, in one preferred embodiment, cannabinoid production may be localized to the cytosol allowing cannabinoids to accumulate in the cytoplasm. In one exemplary embodiment, an artificially modified cannabinoids synthase protein may be generated. In this example embodiment, a CBDA synthase may have the trichome targeting sequence remove forming a cytosolic CBDA synthase (cytCBDAs) identified as SEQ ID NO. 22, (gene) or 23 (protein). Alternative embodiments would include generation of other artificial cytosol target synthase genes, such as cytosolic THCA synthase (cytTHCAs) identified as SEQ ID NO. 46 (gene).

These preferred embodiments may be particularly suited for cannabinoid cell-suspension culture cannabinoid expression systems, as such culture systems lack the trichomes present in whole plants. As such, in one preferred embodiment, a cannabinoid producing plant may be transformed to one or more of the artificial cytosolic targeted cannabinoid synthase genes lacking a trichome-targeting signal. In an alternative embodiment, such artificial cytosolic targeted cannabinoid synthase genes may be expressed in a cannabinoid producing plant suspension culture where the corresponding endogenous wild-type synthase gene has been inhibited and/or knocked out.

In one embodiment, the invention may include a cannabinoid production, accumulation and modification system that may generate water-soluble cannabinoids. In one preferred embodiment, a plant, such as cannabis or tobacco, may be genetically modified to express one or more heterologous glycosyltransferase genes, such as UDP glycosyltransferase. In this preferred embodiment, UDP glycosyltransferase (76G1) (SEQ ID NO. 7) (gene)/SEQ ID NO. 8 (protein) from Stevia rebaudiana may be expressed in cannabinoid producing plant or cell suspension culture. In a preferred embodiment, the cannabinoid producing plant or cell suspension culture may be Cannabis. In another embodiment, one or more glycosyltransferase from Nicotiana tabacum and/or a homologous glycosyltransferase from Nicotiana benthamiana, may be expressed in a cannabinoid-producing plant, such as cannabis, or may be over-expressed in an endogenous plant and/or plant cell culture system. In a preferred embodiment, a glycosyltransferase gene and/or protein may be selected from the exemplary plant, such as Nicotiana tabacum Such glycosyltransferase gene and/or protein may include, but not limited to: Glycosyltransferase (NtGT5a) Nicotiana tabacum (SEQ ID NO. 26) (Amino Acid); Glycosyltransferase (NtGT5a) Nicotiana tabacum (SEQ ID NO. 27) (DNA); Glycosyltransferase (NtGT5b) Nicotiana tabacum (SEQ ID NO. 28) (Amino Acid); Glycosyltransferase (NtGT5b) Nicotiana tabacum (SEQ ID NO. 29) (DNA); UDP-glycosyltransferase 73C3 (NtGT4) Nicotiana tabacum (SEQ ID NO. 30) (Amino Acid); UDP-glycosyltransferase 73C3 (NtGT4) Nicotiana tabacum (SEQ ID NO. 31) (DNA); Glycosyltransferase (NtGT1b) Nicotiana tabacum (SEQ ID NO. 32) (Amino Acid); Glycosyltransferase (NtGT1b) Nicotiana tabacum (SEQ ID NO. 33) (DNA); Glycosyltransferase (NtGT1a) Nicotiana tabacum (SEQ ID NO. 34) (Amino Acid); Glycosyltransferase (NtGT1a) Nicotiana tabacum (SEQ ID NO. 35) (DNA); Glycosyltransferase (NtGT3) Nicotiana tabacum (SEQ ID NO. 36) (Amino Acid); Glycosyltransferase (NtGT3) Nicotiana tabacum (SEQ ID NO. 37) (DNA); Glycosyltransferase (NtGT2) Nicotiana tabacum (SEQ ID NO. 38) (Amino Acid); and/or Glycosyltransferase (NtGT2) Nicotiana tabacum (SEQ ID NO. 39) (DNA). The sequences from Nicotiana tabacum are exemplary only as other tobacco Glycosyltransferase may be used.

As noted above, such glycosyltransferases may glycosylate the cannabinoids and/or functionalized cannabinoids in a plant or plant cell suspension culture as generally described here. Naturally, other glycosyltransferase genes from alternative sources may be included in the current invention.

As noted above, in one embodiment, one or more glycosyltransferases may be targeted or localized to a portion of the plant cell. For example, in this preferred embodiment, cannabinoid glycosylation may be localized to the trichome allowing cannabinoids to accumulate at higher-then wild-type levels in that structure. In one exemplary embodiment, an artificially modified glycosyltransferase may be generated. In this example embodiment, a UDP glycosyltransferase (76G1) may be fused with a trichome-targeting sequence at its N-terminal tail. This trichome targeting sequence may be recognized by the cell and cause it to be transported to the trichome. This artificial gene construct is identified as SEQ ID NO. 19 (gene), or SEQ ID NO. 20 (protein). In one embodiment, a trichome targeting sequence or domain may be derived from any number of synthases. For example, in one embodiment a THCA Synthase Trichome domain (SEQ ID NO. 40) may be coupled with a glycosyltransferase as generally described above. Moreover, in another example, a CBDA Synthase Trichome targeting domain (SEQ ID NO. 41) may be coupled with a glycosyltransferase as generally described above.

In one embodiment, the inventive technology may include the in vivo generation of one or more cannabinoid glucuronides. As also noted above, UDP-glucuronosyltransferases catalyze the transfer of the glucuronosyl group from uridine 5′-diphospho-glucuronic acid (UDP-glucuronic acid) to substrate molecules that contain oxygen, nitrogen, sulfur or carboxyl functional groups. Glucuronidation of a compound, such as a cannabinoid may modulate the bioavailability, activity, and clearance rate of a compound. As such, in one embodiment, the invention may include a cannabinoid production, accumulation and modification system that may generate water-soluble cannabinoid glucuronides. In one preferred embodiment, a plant, such as cannabis or tobacco, may be genetically modified to express one or more endogenous and/or heterologous UDP-glucuronosyltransferases. Such a UDP-glucuronosyltransferases may be expressed in cannabinoid producing plant, non-cannabinoid producing plant, or cell suspension culture. Non-limiting examples of UDP-glucuronosyltransferases may include UGT1A1, UGT1A3, UGT1A4, UGT1A6, UGT1A7, UGT1A8, UGT1A9, UGT1A1O, UGT2B4, UGT2B7, UGT2BI5, and UGT2BI7—there nucleotide and amino acid sequences being generally know to those of ordinary skill in the art. These UDP-glucuronosyltransferases may be a recombinant UDP-glucuronosyltransferases. Methods of making, transforming plant cells, and expressing recombinant UDP-glucuronosyltransferases are known in the art. In a preferred embodiment, the cannabinoid producing plant or cell suspension culture may be cannabis. In another embodiment, one or more UDP-glucuronosyltransferases and/or a homolog/ortholog of a UDP-glucuronosyltransferase, may be expressed in a cannabinoid-producing plant, such as cannabis, or may be over-expressed in an endogenous plant and/or plant cell culture system. In a preferred embodiment, a UDP-glucuronosyltransferase may be targeted or localized to a portion of the plant cell. For example, in this preferred embodiment, cannabinoid glucuronidation may be localized to the trichome allowing cannabinoids to accumulate at higher-then wild-type levels in that structure. In one exemplary embodiment, an artificially modified UDP-glucuronosyltransferase may be generated. In this embodiment, a UDP-glucuronosyltransferase may be fused with a trichome-targeting sequence at its N-terminal tail. This trichome targeting sequence may be recognized by the cell and cause it to be transported to the trichome. In one embodiment, a trichome targeting sequence or domain may be derived from any number of synthases. For example, in one embodiment a THCA Synthase trichome domain (SEQ ID NO. 40) may be coupled with a UDP-glucuronosyltransferase as generally described above. Moreover, in another example, a CBDA Synthase trichome targeting domain (SEQ ID NO. 41) may be coupled with a UDP-glucuronosyltransferase as generally described above. In another embodiment, a UDP-glucuronosyltransferase may further be targeted to the cytosol as generally described herein.

In another embodiment, invention may include an embodiment where transiently modified cannabinoids may be passively and/or actively excreted from a cell or into a cell wall. In one exemplary model, an exogenous ATP-binding cassette transporter (ABC transporters or ABCt) or other similar molecular structure may recognize the glycosyl or glucuronic acid or acetyl functional group (conjugate) on the transiently modified cannabinoid and actively transport it across the cell wall/membrane and into the surrounding media.

In one embodiment, a plant may be transformed to express a heterologous ABC transporter. In this embodiment, an ABCt may facilitate cannabinoid transport outside the cells in suspension cultures, such as a cannabis or tobacco cell suspension culture. In this preferred embodiment, a human multi-drug transported (ABCG2) may be expressed in a plant cell suspension culture of the same respectively. ABCG2 is a plasma membrane directed protein and may further be identified as SEQ ID NO. 9 (gene), or 10 (protein).

Generally, a trichome structure, such as in Cannabis or tobacco, will have very little to no substrate for a glycosyltransferase enzyme to use to effectuate glycosylation. To resolve this problem, in one embodiment, the invention may include systems, methods and compositions to increase substrates for glycosyltransferase, namely select sugars in a trichome. In one preferred embodiment, the invention may include the targeted or localization of sugar transport to the trichome. In this preferred embodiment, an exogenous or endogenous UDP-glucose/UDP-galactose transporter (UTR1) may be expressed in a trichome producing plant, such as cannabis or tobacco and the like. In this embodiment, the UDP-glucose/UDP-galactose transporter (UTR1) may be modified to include a plasma-membrane targeting sequence and/or domain. With this targeting domain, the UDP-glucose/UDP-galactose transporter (UTR1) may allow the artificial fusion protein to be anchored to the plasma membrane. In this configuration, sugar substrates from the cytosol may pass through the plasma membrane bound UDP-glucose/UDP-galactose transporter (PM-UTR1) into the trichome. In this embodiment, substrates for glycosyltransferase may be localized to the trichome and allowed to accumulate further allowing enhanced glycosylation of cannabinoids in the trichome. In one example, SEQ ID NO. 21 is identified as the polynucleotide gene sequence for a heterologous UDP-glucose/galactose transporter (UTR1) from Arabidopsis thaliana having a plasma-membrane targeting sequence replacing a tonoplast targeting sequence. The plasma membrane targeting sequence of this exemplary fusion protein may include the following sequence (see SEQ ID NO 21) TGCTCCATAATGAACTTAATGTGTGGGTCTACCTGCGCCGCT, or a sequence having 70-99% homology with the sequence.

It should be noted that a number of combinations and permutations of the genes/proteins described herein may be co-expressed and thereby accomplish one or more of the goals of the current invention. Such combinations are exemplary of preferred embodiments only, and not limiting in any way.

In one embodiment, a gene, such as a cannabinoid synthase, or a gene fragment corresponding with, for example a signal domain may be inhibited, downregulated, disrupted, or may even be knocked-out. One of ordinary skill in the art will recognize the many processes that can accomplish this without undue experimentation. In other embodiment, a knock-out may mean overexpression of an modified endo- or exogenous gene compared to the wt version.

For example, in one embodiment high levels of cannabinoid glycosylation may be generated by co-expressing CYP3A4 and CYP oxidoreductase (cytochrome P450 with P450 oxidoreductase) and at least one endogenous glycosyltransferases in N. benthamiana. In another embodiment, one or more of the endogenous or exogenous gene may be expressed in a plant or plant cell culture with the co-expression of myb and/or a catalase. In this configuration, there exists an additive effect of over-expressing a Myb transcription factor and a catalase, one or more of which may be targeted or localized, in the synthesis of water-soluble cannabinoids (glycosylated and hydroxylated) in Cannabis sativa.

In certain embodiments, endocannabinoids may be functionalized and/or acetylated and/or glycosylated as generally described herein.

All sequences described herein include sequences having between 70-99% homology with the sequence identified

The modified cannabinoids compounds of the present invention are useful for a variety of therapeutic applications. For example, the compounds are useful for treating or alleviating symptoms of diseases and disorders involving CB1 and CB2 receptors, including appetite loss, nausea and vomiting, pain, multiple sclerosis and epilepsy. For example, they may be used to treat pain (i.e. as analgesics) in a variety of applications including but not limited to pain management. In additional embodiments, such modified cannabinoids compounds may be used as an appetite suppressant. Additional embodiment may include administering the modified cannabinoids compounds

By “treating” the present inventors mean that the compound is administered in order to alleviate symptoms of the disease or disorder being treated. Those of skill in the art will recognize that the symptoms of the disease or disorder that is treated may be completely eliminated, or may simply be lessened. Further, the compounds may be administered in combination with other drugs or treatment modalities, such as with chemotherapy or other cancer-fighting drugs.

Implementation may generally involve identifying patients suffering from the indicated disorders and administering the compounds of the present invention in an acceptable form by an appropriate route. The exact dosage to be administered may vary depending on the age, gender, weight and overall health status of the individual patient, as well as the precise etiology of the disease. However, in general, for administration in mammals (e.g. humans), dosages in the range of from about 0.1 to about 30 mg of compound per kg of body weight per 24 hr., and more preferably about 0.1 to about 10 mg of compound per kg of body weight per 24 hr., are effective.

Administration may be oral or parenteral, including intravenously, intramuscularly, subcutaneously, intradermal injection, intraperitoneal injection, etc., or by other routes (e.g. transdermal, sublingual, oral, rectal and buccal delivery, inhalation of an aerosol, etc.). In a preferred embodiment of the invention, the water-soluble cannabinoid analogs are provided orally or intravenously.

In particular, the phenolic esters of the invention (Formula 1) are preferentially administered systemically in order to afford an opportunity for metabolic activation via in vivo cleavage of the ester. In addition, the water soluble compounds with azole moieties at the pentyl side chain (Formula 2, e.g. with imidazole moieties) do not require in vivo activation and may be suitable for direct administration (e.g. site specific injection).

The compounds may be administered in the pure form or in a pharmaceutically acceptable formulation including suitable elixirs, binders, and the like (generally referred to a “carriers”) or as pharmaceutically acceptable salts (e.g. alkali metal salts such as sodium, potassium, calcium or lithium salts, ammonium, etc.) or other complexes. It should be understood that the pharmaceutically acceptable formulations include liquid and solid materials conventionally utilized to prepare both injectable dosage forms and solid dosage forms such as tablets and capsules and aerosolized dosage forms. In addition, the compounds may be formulated with aqueous or oil based vehicles. Water may be used as the carrier for the preparation of compositions (e.g. injectable compositions), which may also include conventional buffers and agents to render the composition isotonic. Other potential additives and other materials (preferably those which are generally regarded as safe [GRAS]) include: colorants; flavorings; surfactants (TWEEN, oleic acid, etc.); solvents, stabilizers, elixirs, and binders or encapsulants (lactose, liposomes, etc). Solid diluents and excipients include lactose, starch, conventional disintegrating agents, coatings and the like. Preservatives such as methyl paraben or benzalkium chloride may also be used. Depending on the formulation, it is expected that the active composition will consist of about 1% to about 99% of the composition and the vehicular “carrier” will constitute about 1% to about 99% of the composition. The pharmaceutical compositions of the present invention may include any suitable pharmaceutically acceptable additives or adjuncts to the extent that they do not hinder or interfere with the therapeutic effect of the active compound.

The administration of the compounds of the present invention may be intermittent, bolus dose, or at a gradual or continuous, constant or controlled rate to a patient. In addition, the time of day and the number of times per day that the pharmaceutical formulation is administered may vary are and best determined by a skilled practitioner such as a physician. Further, the effective dose can vary depending upon factors such as the mode of delivery, gender, age, and other conditions of the patient, as well as the extent or progression of the disease. The compounds may be provided alone, in a mixture containing two or more of the compounds, or in combination with other medications or treatment modalities. The compounds may also be added to blood ex vivo and then be provided to the patient.

Genes encoding by a combination polynucleotide and/or a homologue thereof, may be introduced into a plant, and/or plant cell using several types of transformation approaches developed for the generation of transgenic plants. Standard transformation techniques, such as Ti-plasmid Agrobacterium-mediated transformation, particle bombardment, microinjection, and electroporation may be utilized to construct stably transformed transgenic plants.

As used herein, a “cannabinoid” is a chemical compound (such as cannabinol, THC or cannabidiol) that is found in the plant species Cannabis among others like Echinacea; Acmella oleracea; Helichrysum umbraculigerum; Radula marginata (Liverwort) and Theobroma cacao, and metabolites and synthetic analogues thereof that may or may not have psychoactive properties. Cannabinoids therefore include (without limitation) compounds (such as THC) that have high affinity for the cannabinoid receptor (for example Ki<250 nM), and compounds that do not have significant affinity for the cannabinoid receptor (such as cannabidiol, CBD). Cannabinoids also include compounds that have a characteristic dibenzopyran ring structure (of the type seen in THC) and cannabinoids which do not possess a pyran ring (such as cannabidiol). Hence a partial list of cannabinoids includes THC, CBD, dimethyl heptylpentyl cannabidiol (DMHP-CBD), 6,12-dihydro-6-hydroxy-cannabidiol (described in U.S. Pat. No. 5,227,537, incorporated by reference); (3 S,4R)-7-hydroxy-Δ6-tetrahydrocannabinol homologs and derivatives described in U.S. Pat. No. 4,876,276, incorporated by reference; (+)-4-[4-DMH-2,6-diacetoxy-phenyl]-2-carboxy-6,6-dimethylbicyclo[3.1.1]hept-2-en, and other 4-phenylpinene derivatives disclosed in U.S. Pat. No. 5,434,295, which is incorporated by reference; and cannabidiol (−)(CBD) analogs such as (−)CBD-monomethylether, (−)CBD dimethyl ether; (−)CBD diacetate; (−)3′-acetyl-CBD monoacetate; and ±AF11, all of which are disclosed in Consroe et al., J. Clin. Pharmacol. 21:428S-436S, 1981, which is also incorporated by reference. Many other cannabinoids are similarly disclosed in Agurell et al., Pharmacol. Rev. 38:31-43, 1986, which is also incorporated by reference.

As claimed herein, the term “cannabinoid” may also include different modified forms of a cannabinoid such as a hydroxylated cannabinoid or cannabinoid carboxylic acid. For example, if a glycosyltransferase were to be capable of glycosylating a cannabinoid, it would include the term cannabinoid as defined elsewhere, as well as the aforementioned modified forms. It may further include multiple glycosylation moieties.

Examples of cannabinoids are tetrahydrocannabinol, cannabidiol, cannabigerol, cannabichromene, cannabicyclol, cannabivarin, cannabielsoin, cannabicitran, cannabigerolic acid, cannabigerolic acid monomethylether, cannabigerol monomethylether, cannabigerovarinic acid, cannabigerovarin, cannabichromenic acid, cannabichromevarinic acid, cannabichromevarin, cannabidolic acid, cannabidiol monomethylether, cannabidiol-C4, cannabidivarinic acid, cannabidiorcol, delta-9-tetrahydrocannabinolic acid A, delta-9-tetrahydrocannabinolic acid B, delta-9-tetrahydrocannabinolic acid-C4, delta-9-tetrahydrocannabivarinic acid, delta-9-tetrahydrocannabivarin, delta-9-tetrahydrocannabiorcolic acid, delta-9-tetrahydrocannabiorcol, delta-7-cis-iso-tetrahydrocannabivarin, delta-8-tetrahydrocannabiniolic acid, delta-8-tetrahydrocannabinol, cannabicyclolic acid, cannabicylovarin, cannabielsoic acid A, cannabielsoic acid B, cannabinolic acid, cannabinol methylether, cannabinol-C4, cannabinol-C2, cannabiorcol, 10-ethoxy-9-hydroxy-delta-6a-tetrahydrocannabinol, 8,9-dihydroxy-delta-6a-tetrahydrocannabinol, cannabitriolvarin, ethoxy-cannabitriolvarin, dehydrocannabifuran, cannabifuran, cannabichromanon, cannabicitran, 10-oxo-delta-6a-tetrahydrocannabinol, delta-9-cis-tetrahydrocannabinol, 3, 4, 5, 6-tetrahydro-7-hydroxy-alpha-alpha-2-trimethyl-9-n-propyl-2, 6-methano-2H-1-benzoxocin-5-methanol-cannabiripsol, trihydroxy-delta-9-tetrahydrocannabinol, and cannabinol. Examples of cannabinoids within the context of this disclosure include tetrahydrocannabinol and cannabidiol.

The term “endocannabinoid” refer to compounds including arachidonoyl ethanolamide (anandamide, AEA), 2-arachidonoyl ethanolamide (2-AG), 1-arachidonoyl ethanolamide (1-AG), and docosahexaenoyl ethanolamide (DHEA, synaptamide), oleoyl ethanolamide (OEA), eicsapentaenoyl ethanolamide, prostaglandin ethanolamide, docosahexaenoyl ethanolamide, linolenoyl ethanolamide, 5(Z),8(Z),1 1 (Z)-eicosatrienoic acid ethanolamide (mead acid ethanolamide), heptadecanoul ethanolamide, stearoyl ethanolamide, docosaenoyl ethanolamide, nervonoyl ethanolamide, tricosanoyl ethanolamide, lignoceroyl ethanolamide, myristoyl ethanolamide, pentadecanoyl ethanolamide, palmitoleoyl ethanolamide, docosahexaenoic acid (DHA). Particularly preferred endocannabinoids are AEA, 2-AG, 1-AG, and DHEA.

Hydroxylation is a chemical process that introduces a hydroxyl group (—OH) into an organic compound. Acetylation is a chemical reaction that adds an acetyl chemical group. Glycosylation is the coupling of a glycosyl donor, to a glycosyl acceptor forming a glycoside.

The term “prodrug” refers to a precursor of a biologically active pharmaceutical agent (drug). Prodrugs must undergo a chemical or a metabolic conversion to become a biologically active pharmaceutical agent. A prodrug can be converted ex vivo to the biologically active pharmaceutical agent by chemical transformative processes. In vivo, a prodrug is converted to the biologically active pharmaceutical agent by the action of a metabolic process, an enzymatic process or a degradative process that removes the prodrug moiety to form the biologically active pharmaceutical agent.

As used herein, the term “homologous” with regard to a contiguous nucleic acid sequence, refers to contiguous nucleotide sequences that hybridize under appropriate conditions to the reference nucleic acid sequence. For example, homologous sequences may have from about 70%-100, or more generally 80% to 100% sequence identity, such as about 81%; about 82%; about 83%; about 84%; about 85%; about 86%; about 87%; about 88%; about 89%; about 90%; about 91%; about 92%; about 93%; about 94% about 95%; about 96%; about 97%; about 98%; about 98.5%; about 99%; about 99.5%; and about 100%. The property of substantial homology is closely related to specific hybridization. For example, a nucleic acid molecule is specifically hybridizable when there is a sufficient degree of complementarity to avoid non-specific binding of the nucleic acid to non-target sequences under conditions where specific binding is desired, for example, under stringent hybridization conditions.

The term, “operably linked,” when used in reference to a regulatory sequence and a coding sequence, means that the regulatory sequence affects the expression of the linked coding sequence. “Regulatory sequences,” or “control elements,” refer to nucleotide sequences that influence the timing and level/amount of transcription, RNA processing or stability, or translation of the associated coding sequence. Regulatory sequences may include promoters; translation leader sequences; introns; enhancers; stem-loop structures; repressor binding sequences; termination sequences; polyadenylation recognition sequences; etc. Particular regulatory sequences may be located upstream and/or downstream of a coding sequence operably linked thereto. Also, particular regulatory sequences operably linked to a coding sequence may be located on the associated complementary strand of a double-stranded nucleic acid molecule.

As used herein, the term “promoter” refers to a region of DNA that may be upstream from the start of transcription, and that may be involved in recognition and binding of RNA polymerase and other proteins to initiate transcription. A promoter may be operably linked to a coding sequence for expression in a cell, or a promoter may be operably linked to a nucleotide sequence encoding a signal sequence which may be operably linked to a coding sequence for expression in a cell. A “plant promoter” may be a promoter capable of initiating transcription in plant cells. Examples of promoters under developmental control include promoters that preferentially initiate transcription in certain tissues, such as leaves, roots, seeds, fibers, xylem vessels, tracheids, or sclerenchyma. Such promoters are referred to as “tissue-preferred.” Promoters which initiate transcription only in certain tissues are referred to as “tissue-specific.”

A “cell type-specific” promoter primarily drives expression in certain cell types in one or more organs, for example, vascular cells in roots or leaves. An “inducible” promoter may be a promoter which may be under environmental control. Examples of environmental conditions that may initiate transcription by inducible promoters include anaerobic conditions and the presence of light. Tissue-specific, tissue-preferred, cell type specific, and inducible promoters constitute the class of “non-constitutive” promoters. A “constitutive” promoter is a promoter which may be active under most environmental conditions or in most cell or tissue types.

Any inducible promoter can be used in some embodiments of the invention. See Ward et al. (1993) Plant Mol. Biol. 22:361-366. With an inducible promoter, the rate of transcription increases in response to an inducing agent. Exemplary inducible promoters include, but are not limited to: Promoters from the ACEI system that responds to copper; In2 gene from maize that responds to benzenesulfonamide herbicide safeners; Tet repressor from Tn10; and the inducible promoter from a steroid hormone gene, the transcriptional activity of which may be induced by a glucocorticosteroid hormone are general examples (Schena et al. (1991) Proc. Natl. Acad. Sci. USA 88:0421).

As used herein, the term “transformation” or “genetically modified” refers to the transfer of one or more nucleic acid molecule(s) into a cell. A plant is “transformed” or “genetically modified” by a nucleic acid molecule transduced into the plant when the nucleic acid molecule becomes stably replicated by the plant. As used herein, the term “transformation” or “genetically modified” encompasses all techniques by which a nucleic acid molecule can be introduced into, such as a plant.

The term “vector” refers to some means by which DNA, RNA, a protein, or polypeptide can be introduced into a host. The polynucleotides, protein, and polypeptide which are to be introduced into a host can be therapeutic or prophylactic in nature; can encode or be an antigen; can be regulatory in nature, etc. There are various types of vectors including virus, plasmid, bacteriophages, cosmids, and bacteria.

As is known in the art, different organisms preferentially utilize different codons for generating polypeptides. Such “codon usage” preferences may be used in the design of nucleic acid molecules encoding the proteins and chimeras of the invention in order to optimize expression in a particular host cell system.

An “expression vector” is nucleic acid capable of replicating in a selected host cell or organism. An expression vector can replicate as an autonomous structure, or alternatively can integrate, in whole or in part, into the host cell chromosomes or the nucleic acids of an organelle, or it is used as a shuttle for delivering foreign DNA to cells, and thus replicate along with the host cell genome. Thus, an expression vector are polynucleotides capable of replicating in a selected host cell, organelle, or organism, e.g., a plasmid, virus, artificial chromosome, nucleic acid fragment, and for which certain genes on the expression vector (including genes of interest) are transcribed and translated into a polypeptide or protein within the cell, organelle or organism; or any suitable construct known in the art, which comprises an “expression cassette.” In contrast, as described in the examples herein, a “cassette” is a polynucleotide containing a section of an expression vector of this invention. The use of the cassettes assists in the assembly of the expression vectors. An expression vector is a replicon, such as plasmid, phage, virus, chimeric virus, or cosmid, and which contains the desired polynucleotide sequence operably linked to the expression control sequence(s).

A polynucleotide sequence is operably linked to an expression control sequence(s) (e.g., a promoter and, optionally, an enhancer) when the expression control sequence controls and regulates the transcription and/or translation of that polynucleotide sequence.

Unless otherwise indicated, a particular nucleic acid sequence also implicitly encompasses conservatively modified variants thereof (e.g., degenerate codon substitutions), the complementary (or complement) sequence, and the reverse complement sequence, as well as the sequence explicitly indicated. Specifically, degenerate codon substitutions may be achieved by generating sequences in which the third position of one or more selected (or all) codons is substituted with mixed-base and/or deoxyinosine residues (see e.g., Batzer et al., Nucleic Acid Res. 19:5081 (1991); Ohtsuka et al., J. Biol. Chem. 260:2605-2608 (1985); and Rossolini et al., Mol. Cell. Probes 8:91-98 (1994)). Because of the degeneracy of nucleic acid codons, one can use various different polynucleotides to encode identical polypeptides. Table 1a, infra, contains information about which nucleic acid codons encode which amino acids.

TABLE 4 Amino acid Nucleic acid codons Amino Acid Nucleic Acid Codons Ala/A GCT, GCC, GCA, GCG Arg/R CGT, CGC, CGA, CGG, AGA, AGG Asn/N AAT, AAC Asp/D GAT, GAC Cys/C TGT, TGC Gln/Q CAA, CAG Glu/E GAA, GAG Gly/G GGT, GGC, GGA, GGG His/H CAT, CAC Ile/I ATT, ATC, ATA Leu/L TTA, TTG, CTT, CTC, CTA, CTG Lys/K AAA, AAG Met/M ATG Phe/F TTT, TTC Pro/P CCT, CCC, CCA, CCG Ser/S TCT, TCC, TCA, TCG, AGT, AGC Thr/T ACT, ACC, ACA, ACG Trp/W TGG Tyr/Y TAT, TAC Val/V GTT, GTC, GTA, GTG

The term “plant” or “plant system” includes whole plants, plant organs, progeny of whole plants or plant organs, embryos, somatic embryos, embryo-like structures, protocorms, protocorm-like bodies (PLBs), and culture and/or suspensions of plant cells. Plant organs comprise, e.g., shoot vegetative organs/structures (e.g., leaves, stems and tubers), roots, flowers and floral organs/structures (e.g., bracts, sepals, petals, stamens, carpels, anthers and ovules), seed (including embryo, endosperm, and seed coat) and fruit (the mature ovary), plant tissue (e.g., vascular tissue, ground tissue, and the like) and cells (e.g., guard cells, egg cells, trichomes and the like). The invention may also include Cannabaceae and other Cannabis strains, such as C. sativa generally.

The term “expression,” as used herein, or “expression of a coding sequence” (for example, a gene or a transgene) refers to the process by which the coded information of a nucleic acid transcriptional unit (including, e.g., genomic DNA or cDNA) is converted into an operational, non-operational, or structural part of a cell, often including the synthesis of a protein. Gene expression can be influenced by external signals; for example, exposure of a cell, tissue, or organism to an agent that increases or decreases gene expression. Expression of a gene can also be regulated anywhere in the pathway from DNA to RNA to protein. Regulation of gene expression occurs, for example, through controls acting on transcription, translation, RNA transport and processing, degradation of intermediary molecules such as mRNA, or through activation, inactivation, compartmentalization, or degradation of specific protein molecules after they have been made, or by combinations thereof. Gene expression can be measured at the RNA level or the protein level by any method known in the art, including, without limitation, Northern blot, RT-PCR, Western blot, or in vitro, in situ, or in vivo protein activity assay(s).

The term “nucleic acid” or “nucleic acid molecules” include single- and double-stranded forms of DNA; single-stranded forms of RNA; and double-stranded forms of RNA (dsRNA). The term “nucleotide sequence” or “nucleic acid sequence” refers to both the sense and antisense strands of a nucleic acid as either individual single strands or in the duplex. The term “ribonucleic acid” (RNA) is inclusive of iRNA (inhibitory RNA), dsRNA (double stranded RNA), siRNA (small interfering RNA), mRNA (messenger RNA), miRNA (micro-RNA), hpRNA (hairpin RNA), tRNA (transfer RNA), whether charged or discharged with a corresponding acylated amino acid), and cRNA (complementary RNA). The term “deoxyribonucleic acid” (DNA) is inclusive of cDNA, genomic DNA, and DNA-RNA hybrids. The terms “nucleic acid segment” and “nucleotide sequence segment,” or more generally “segment,” will be understood by those in the art as a functional term that includes both genomic sequences, ribosomal RNA sequences, transfer RNA sequences, messenger RNA sequences, operon sequences, and smaller engineered nucleotide sequences that encoded or may be adapted to encode, peptides, polypeptides, or proteins.

The term “gene” or “sequence” refers to a coding region operably joined to appropriate regulatory sequences capable of regulating the expression of the gene product (e.g., a polypeptide or a functional RNA) in some manner. A gene includes untranslated regulatory regions of DNA (e.g., promoters, enhancers, repressors, etc.) preceding (up-stream) and following (down-stream) the coding region (open reading frame, ORF) as well as, where applicable, intervening sequences (i.e., introns) between individual coding regions (i.e., exons). The term “structural gene” as used herein is intended to mean a DNA sequence that is transcribed into mRNA which is then translated into a sequence of amino acids characteristic of a specific polypeptide.

A nucleic acid molecule may include either or both naturally occurring and modified nucleotides linked together by naturally occurring and/or non-naturally occurring nucleotide linkages. Nucleic acid molecules may be modified chemically or biochemically, or may contain non-natural or derivatized nucleotide bases, as will be readily appreciated by those of skill in the art. Such modifications include, for example, labels, methylation, substitution of one or more of the naturally occurring nucleotides with an analog, internucleotide modifications (e.g., uncharged linkages: for example, methyl phosphonates, phosphotriesters, phosphoramidates, carbamates, etc.; charged linkages: for example, phosphorothioates, phosphorodithioates, etc.; pendent moieties: for example, peptides; intercalators: for example, acridine, psoralen, etc.; chelators; alkylators; and modified linkages: for example, alpha anomeric nucleic acids, etc.). The term “nucleic acid molecule” also includes any topological conformation, including single-stranded, double-stranded, partially duplexed, triplexed, hair-pinned, circular, and padlocked conformations.

As used herein with respect to DNA, the term “coding sequence,” “structural nucleotide sequence,” or “structural nucleic acid molecule” refers to a nucleotide sequence that is ultimately translated into a polypeptide, via transcription and mRNA, when placed under the control of appropriate regulatory sequences. With respect to RNA, the term “coding sequence” refers to a nucleotide sequence that is translated into a peptide, polypeptide, or protein. The boundaries of a coding sequence are determined by a translation start codon at the 5′-terminus and a translation stop codon at the 3′-terminus. Coding sequences include, but are not limited to: genomic DNA; cDNA; EST; and recombinant nucleotide sequences.

The term “sequence identity” or “identity,” as used herein in the context of two nucleic acid or polypeptide sequences, refers to the residues in the two sequences that are the same when aligned for maximum correspondence over a specified comparison window.

The term “recombinant” when used with reference, e.g., to a cell, or nucleic acid, protein, or vector, indicates that the cell, organism, nucleic acid, protein or vector, has been modified by the introduction of a heterologous nucleic acid or protein, or the alteration of a native nucleic acid or protein, or that the cell is derived from a cell so modified. Thus, for example, recombinant cells may express genes that are not found within the native (nonrecombinant or wild-type) form of the cell or express native genes that are otherwise abnormally expressed—over-expressed, under expressed or not expressed at all.

The terms “approximately” and “about” refer to a quantity, level, value or amount that varies by as much as 30%, or in another embodiment by as much as 20%, and in a third embodiment by as much as 10% to a reference quantity, level, value or amount. As used herein, the singular form “a,” “an,” and “the” include plural references unless the context clearly dictates otherwise.

As used herein, “heterologous” or “exogenous” in reference to a nucleic acid is a nucleic acid that originates from a foreign species, or is synthetically designed, or, if from the same species, is substantially modified from its native form in composition and/or genomic locus by deliberate human intervention. A heterologous protein may originate from a foreign species or, if from the same species, is substantially modified from its original form by deliberate human intervention. By “host cell” is meant a cell which contains an introduced nucleic acid construct and supports the replication and/or expression of the construct. Host cells may be prokaryotic cells such as E. coli, or eukaryotic cells such as fungi, yeast, insect, amphibian, nematode, or mammalian cells. Alternatively, the host cells are monocotyledonous or dicotyledonous plant cells. An example of a monocotyledonous host cell is a maize host cell

EXAMPLES Example 1: Functionalization of Cannabinoids by Cytochrome P450s

The present inventors have demonstrated that cannabinoids can be functionalized in an in vivo plant system. Specifically, the present inventors utilized cytochrome P450 monooxygenases (CYP) to modify or functionalize the chemical structure of cannabinoids. As shown below, CYPs do this by inserting an oxygen atom into hydrophobic molecules to make them more reactive and hydrophilic. A representative reaction may include the generalized reaction in FIG. 13 .

The P450 enzyme system involves several cytochrome P450 species and nonspecific cytochrome P450 oxidoreductases. As shown in FIG. 5 , the present inventors used a human cytochrome P450 (CYP3A4) in a double construct with an exemplary human cytochrome P450 oxidoreductase, both expressed under the control of the constitutive CaMV 35S promoter with 5′ untranslated regions to enhance translation. Protein and DNA sequences for the functionalization of cannabinoids (CYP3A4 and P450 oxidoreductase) are identified as SEQ ID NO's. 1-4. Expression was confirmed using RT-PCR utilizing the forward and reverse primers identified in Table 3 below. As noted above, the present inventors demonstrated that overexpressing of P450s generated functionalized cannabinoids which could then be glycosylated, rendering them water-soluble.

Example 2: P450 Overexpression Enhances In Vivo Hydroxylation and Glycosylation of Cannabinoids in Plant Systems

The present inventors have demonstrated that overexpression enhanced in vivo hydroxylation and glycosylation of CBDA in an exemplary plant system. Specifically, as generally shown in FIG. 6 , the present inventors demonstrate that infiltration of tobacco leaves with Agrobacterium carrying CYP3A4 and P450 oxidoreductase was accomplished as described in herein. Confirmation of expression was done using RT-PCR 2-3 days after infiltration (FIG. 6 ).

As generally shown in FIG. 7 , the present inventors demonstrate that overexpression of the CYP3A4+P450 oxidoreductase construct and subsequent feeding of at least one cannabinoid, in this case CBDA, upon confirmation of expression resulted in in vivo glycosylation of CBDA in tobacco leaves (FIG. 7 ). On average, glycosylation increased 3-fold in transgenic N. benthamiana plants compared to the control while hydroxylation increased up to 13-fold. As such, in certain embodiment, tobacco glycosyltransferases may be utilized as key targets in the current inventive technology for glycosylation of cannabinoids.

Example 3: Identification of Modified Water-Soluble Cannabinoids by Mass Spectrometry

The present inventors demonstrated the biosynthesis of modified functionalized as well as water-soluble cannabinoids in both in vitro as well as in vivo plant system. Specifically, the present inventors identified the cannabinoid biotransformations associated with the gene constructs in both in vitro assays and transient leaf expression. Through the use of accurate mass spectrometry measurements, the present inventors were able to identify and confirm the biosynthesis of modified water-soluble cannabinoids.

Specifically, as generally shown in FIGS. 1-4 , the present inventors were able to identify the glycosylated water-soluble cannabinoids in the chromatographic analysis and were able to produce extracted ion chromatograms for peak integration. For example, FIG. 1 panel B, illustrates the identification of multiple constitutional cannabinoid isomers of a single glycoside moiety, while in FIG. 2 panel B, an example of multiple constitutional isomers of the cytochrome P450 oxidation are illustrated. Peak areas for each identified molecule were used for relative quantification between treatments. Based on these results we confirmed biosynthesis of modified cannabinoid molecules containing up to two glycosides moieties, 0 acetyl glycoside, as well as hydroxylation (R—OH) biotransformations.

Tables 1 and 2 are provided below further demonstrating the production of the select modified cannabinoid molecules. Generally referring to Tables 1-2 below, the present inventors demonstrated that based on the reduced retention time in the water: acetonitrile HPLC gradient, the glycosylated and hydroxylated cannabinoids, which eluted earlier than their non-modified forms, are demonstrated to be more water soluble than their non-modified forms.

Example 4: Generation of Heterologous Cytosolic Synthesis and Glycosylation Gene Constructs for Expressions in Tobacco Leaves and Cell Suspensions

As shown in FIG. 8 , the present inventors generated a triple gene construct for expression of cannabidiolic acid (CBDA) synthase in which the trichome targeting sequence had been removed, and the glycosyltransferase 76G1 from Stevia rebaudiana. In this construct the multi-drug ABC transporter ABCG2 was also included.

In one embodiment of the present inventive technology, the gene construct may be used to transform a plant cell that may further be configured to be cultured in a suspension culture. In one preferred embodiment, a cannabis cell may be transformed with the construct generally outline in FIG. 8 . In this preferred embodiment, cannabinoids produced by the cannabis cells in the cell culture may be functionalize through the overexpression of the CYP3A4+P450 oxidoreductase as described above, and further glycosylated by the expression and action of the heterologous UDP glycosyltransferase (76G1) from Stevia rebaudiana referend above. Moreover, as generally outline herein, the cannabinoids may be modified so as to be functionalized and/or glycosylated, or generally water-soluble, and may then be secreted into the cell wall area, in the case of a whole plant, or the surrounding media in suspension cultures, with the aid of the ABC transporter. In one embodiment, this construct may be used for synthesis and modification of cannabinoids in cell suspension cultures, utilizing tobacco bright yellow cells or cannabis cells.

As generally shown in FIG. 9 , in vivo expression of CBDA synthase, UDP glycosyltransferase 76G1 and ABCG2 was confirmed. Reverse and forward primers used in the RT-PCR reactions are provided below in Table 4 below.

The gene and protein sequence identifications for CBDA synthase are provided as SEQ ID NO's 5 and 6 respectively. It should be noted that a variety of cannabinoid synthase genes/proteins may be used with the current inventive technology, CBDA synthase being exemplary only. Indeed, it is specifically contemplated that the synthase enzyme associated with any of the cannabinoids identified herein may be incorporated into the current invention without undue experimentation. In one embodiment, one or more of such exogenous or endogenous synthase enzyme may further have the trichome targeting sequence excised, again, a step that can be readily accomplished without undue experimentation. Example may THCA synthase, CBG synthase, THCA synthase, CBDA synthase or CBCA synthase, which may in this embodiment have their trichome targeting sequence had been removed.

The gene and protein sequence identifications for glycosyltransferase 76G1 from Stevia rebaudiana are provided as SEQ ID NO's. 7, and 8 respectively. The gene and protein sequence identifications for the multi-drug ABC transporter ABCG2 are provided as SEQ ID NO's 9 and respectively.

Example 5: In Vivo Cytosolic Synthesis and Glycosylation of Cannabinoids in N. benthamiana Leaves and Cell Suspensions

As shown in FIG. 10 , the present inventors demonstrate that in plants, in this embodiment N. benthamiana, expressing the above referenced cytosolic construct, glycosylation of CBGA occurred as well as formation of modified or hydroxylated CBDA. The glycosylation of CBGA evidences in vivo glycosylation of cannabinoids by overexpressing a glycosyltransferase in N. benthamiana plants. The presence of glycosylated cannabinoids in wild type plants suggests the presence of a strong glycosyltransferase in tobacco. As such, in one embodiment, over expression of a heterologous or homologous tobacco glycosyltransferase may expressed or overexpressed resulting in the enhanced in vivo biosynthesis of water-soluble cannabinoids in whole plants, as well as in suspension cultures. For example, in one embodiment, a heterologous tobacco glycosyltransferase may be expressed in a cannabis plant or cell culture resulting in the in vivo biosynthesis of water-soluble cannabinoids in the Cannabis plant and/or a Cannabis suspension cultures.

Example 6: Water Soluble Cannabinoid Production Systems Utilizing MYB Transcription Factor and/or Catalase

The present inventors have developed a plurality of systems for the biosynthesis and modification of cannabinoids based on cellular location using novel methods of protein targeting. As shown in Table 10, the present inventors designed such novel systems and methods to enhance production and modification (glycosylation, acetylation and functionalization) of cannabinoids as well as to mitigate toxicity resulting from cannabinoid accumulation. Certain embodiments, included the expression of a MYB transcription factor and a catalase (FIG. 27 ) to degrade hydrogen peroxide resulting from CBDA synthase activity. In one preferred embodiment, the present inventors used Arabidopsis thaliana or an E. coli catalase gene and a predicted Cannabis MYB transcription factor involved in elevating genes involved in cannabinoid biosynthesis. DNA and protein sequences for Cannabis predicted MYB transcription factor (SEQ ID NOs. 11-12, DNA and amino acid sequences respectively), Arabidopsis thaliana catalase SEQ ID NOs. 13-14, DNA and amino acid sequences respectively) and/or E. coli catalase (SEQ ID NO. 15-16, DNA and amino acid sequences).

Example 7: Enhanced In Vivo Cytosolic Synthesis and Glycosylation of Cannabinoids in Tobacco Leaves and Cell Suspensions

The present inventors have demonstrated the enhanced in vivo modification of cannabinoids in transgenic plants co-infected with constructs for glycosylation, P450-mediated functionalization (hydroxylation) and detoxification of hydrogen peroxide by catalase. As further shown in FIG. 11 , functionalization and glycosylation, mainly of the substrate CBGA was observed in transgenic tobacco plants overexpressing CBDA synthase, UDP glycosyltransferase and ABC transporter but increased when overexpression of this construct was coupled with cytochrome P450, MYB transcription factor and catalase. As previously noted, overexpression of a cytochrome P450 enhanced glycosylation of cannabinoids. As such, the present inventor demonstrated the formation and glycosylation of CBDA in vivo in transiently transformed tobacco leaves fed with the precursor CBGA.

The present inventors also compared the activities of endogenous and transgenic glycosyltransferase activities in tobacco. Specifically, as shown in FIG. 12 , the present inventor performed in vitro assays of UDP glycosyltransferase and CBDA synthase. Short assays of 3 hours at 30° C. did not reveal any difference in glycosylation of CBGA between the wild type and transgenic N. benthamiana plants, suggesting endogenous glycosylation. In extended assays (14 hours), there was a significant difference in the detection of glycosylated CBGA in transgenic plants compared to the wild type demonstrating increased glycosylation activity in transgenic plants.

In certain embodiment, glycosyltransferases from tobacco, or other plants may be used as herein described. In one embodiment, one or more heterologous or homologous glycosyltransferases may be expressed or over expressed in a plant, such as tobacco or Cannabis. Gene and protein sequences for exemplary glycosyltransferases are identified below in Table 9.

Example 8: Generation of Trichome-Targeted Cannabinoid Synthesis and Glycosylation Constructs of Cannabidiolic Acid (CBDA)

As shown in FIGS. 14-15 , the present inventors demonstrated a system of trichome-targeted synthesis and synthesis and glycosylation of cannabinoid compounds, such as CBDA. By targeting CBDA synthase, a UDP-glucose/UDP-galactose transporter (PM-UTR1) targeted to the plasma, and a Stevia UDP-glycosyltransferase 76G1 (tsUGT) to the trichomes, these genes may produce and accumulate, in this case CBDA and its glycosylated derivatives (primary, secondary glycoside), as well as novel CBDA derivatives, in the trichomes.

SEQ ID NO. 17 is identified as the polynucleotide gene sequence for a CBDA synthase having a trichome targeting sequence. SEQ ID NO. 18 is identified as the corresponding protein sequence for a CBDA synthase having a trichome targeting domain.

SEQ ID NO. 19 is identified as the polynucleotide gene sequence for a trichome-targeted UDP-glycosyltransferase (76G1) coding sequence, in this instance being optimized for Arabidopsis thaliana expression, although other codon optimized versions fall within the scope of this invention. SEQ ID NO. 20 is identified as the corresponding protein sequence for a UDP-glycosyltransferase (76G1) having a trichome targeting domain.

SEQ ID NO. 21 is identified as the polynucleotide gene sequence for a UDP-glucose/galactose transporter (UTR1) having a plasma-membrane targeting sequence.

Example 9: Trichome-Targeted Synthesis and Glycosylation of Cannabidiolic Acid (CBDA)

As shown in FIGS. 16-17 , gene expression of CBDA synthase, tsUGT and PM-UTR1 in N. benthamiana infiltrated leaves was confirmed 2 DPI (Days Post Infiltration of Agrobacterium Ti-plasmid constructs) via RT-PCR (FIGS. 19 and 20 ). As expected, CBGA substrate was detected in all infiltrated leaves and wild type control (no Agrobacterium infiltration). CBGA primary and secondary glycosides were also detected in all infiltrated leaves and wild-type control, further demonstrating an endogenous glycosyltransferase activity acting upon CBGA. Moreover, CBGA acetylated primary glycoside was detected in all samples, including WT control, providing evidence of endogenous acetylation. CBDA was detected at marginal levels in samples infiltrated with both trichome and cell suspension constructs, but not in wild type plants.

Example 10: Cytosolic-Targeted Synthesis and Glycosylation of Cannabidiolic Acid (CBDA)

The present inventors have demonstrated a system of cytosolic-targeted cannabinoid synthesis and glycosylation. By targeting or localizing, CBDA synthase (CBDAs) and UDP-glycosyltransferase 76G1 (UGT) to the cytosol, the present inventors demonstrated that plants expressing these heterologous genes produce and accumulate, in this embodiment, CBDA and its glycosylated derivatives (primary, secondary glycoside), as well as other CBDA derivatives, in the cytosol. As shown in FIG. 18 , a gene expression vector for the cytosolic cannabinoid production system was generated. This construct included a cauliflower mosaic 35S promoter; AtADH 5′-UTR, enhancer element; cytCBDAs, cannabidiolic acid synthase with the trichome target sequence removed; HSP terminator; cytUGT76G1, UDP glycosyltransferase from Stevia rebaudiana.

SEQ ID NO. 22 is identified as the polynucleotide gene sequence for a, cannabidiolic acid synthase with the trichome target sequence removed (cytCBDAs). SEQ ID NO. 23 is identified as the corresponding protein sequence of cytCBDAs.

SEQ ID NO. 24 is identified as the polynucleotide gene sequence for a, Cytosolic-targeted UDP-glycosyltransferase (UGT76G1) coding sequence (optimized for Arabidopsis thaliana expression) (cytUGT76G1 or cytUTG). SEQ ID NO. 25 is identified as the corresponding protein sequence of cytUGT76G1 or cytUTG.

As an exemplary plant model, N. benthamiana plants were grown from seed and after 4 weeks of vegetative growth, leaves were co-infiltrated with Agrobacterium tumefaciens GV3101 carrying the following constructs: Cytosolic CBDAs+Cytosolic UGT in pRI201-AN or cell suspension construct, Myb/catalase in pRI201-AN, and p19 silencing suppressor in pDGB3alpha2. Agrobacterium density was normalized to 2 at absorbance of 600 nm using a spectrophotometer and cultures co-infiltrated in same ratio (1:1:1). After 2 and 4 days post-Agrobacterium infiltration (DPI), 1 mL CBGA (2.7 mM) dissolved in 0.1% Tween 20 (Sigma-Aldrich) or 0.1% Triton X-100 (Sigma-Aldrich) was infiltrated to each leaf. In a second embodiment using the cytosolic construct, 4 mM UDP-glucose was added to the CBGA media before feeding. Three biological replicates were used. RT-PCR primers are outlined in Table 5 below.

As shown in FIGS. 19-20 , gene expression of cytCBDAs and cytUGT was confirmed via RT-PCR after 1 and 2 DPI. No expression of ABC transporter (ABCt) was observed after 1 DPI in leaves infiltrated cells suspension construct. This does not impact this experiment as the role of ABCt was to facilitate cannabinoid transport outside the cells in suspension cultures. As shown in FIG. 21 , CBGA and its glycosylated and acylated derivatives were detected in concentrations higher than in the trichome construct infiltrated leaves, except for secondary glycosides. Moreover, CBDA was detected in higher concentrations (up to 34 ppm) in leaves infiltrated with the cell suspension construct, compared to the trichome construct experiments (up to 2.6 ppm). As shown in FIG. 22 , when UDP-glucose 4 mM (substrate for UGT) was provided together with CBGA (substrate for CBDAs), the present inventors detected low levels of glycosylated and hydroxylated CBDA in leaves infiltrated with both the cytosolic and cell suspension construct, but not in the WT control. This result demonstrates the novel in plant synthesis, glycosylation and hydroxylation of CBDA in the surrogate plant N. benthamiana, as demonstrated by the Extracted Ion Chromatograms shown in FIG. 23 .

Example 11: Hydroxylation and Glycosylation of Cannabinoids in Cannabis sativa

The present inventors demonstrate the glycosylation and hydroxylation of cannabinoids in Cannabis sativa. To further confirm our findings using N. benthamiana as a plant model, we performed Agrobacterium infiltration of the same plasmid constructs described in the section above in various strains of Cannabis sativa (see FIG. 24 Sample IDs). As shown in FIGS. 24-26 , expression of the select genetic constructs in C. sativa, as in N. benthamiana, demonstrate synthesis and accumulation of hydroxylated and/or glycosylated cannabinoids, in this case CBDA. A comparison of the results using different Agrobacterium genetic constructs is presented in Table 8 below.

As the present inventors have demonstrated, in one embodiment, where the cytosolic construct was co-transformed with the Myb/catalase (MYBCAT) expression vector, yielded the highest detection of CBDA and CBDA glycoside, demonstrating the role of these genes in mitigating toxicity effects due to hydrogen peroxide accumulation (catalase) and overall increase in cannabinoid synthesis (Myb transcription factor).

Materials and Methods Example 12: Use of a Tobacco as an Exemplary Plant System for the In Vivo Functionalization and Glycosylation of Cannabinoids

The present inventors demonstrated the in vivo functionalization and glycosylation of cannabinoids in a model plant system. Specifically, the present inventors used N. benthamiana (tobacco) as a model system to demonstrate in vivo functionalization and glycosylation of cannabinoids. In this embodiment, transient transformation through Agrobacterium infiltration was performed in N. benthamiana. The present inventors demonstrated expression of heterologous genes that were expressed in transformed N. benthamiana using a number of heterologous gene expression vectors (described below). In this exemplary embodiment, upon confirmation of expression of the heterologous genes that would functionalize and glycosylate cannabinoid molecules, the present inventors introduced to the plants select cannabinoid compounds. In this embodiment, the present inventors introduced to the transgenic N. benthamiana plants cannabigerolic acid (CBGA) and/or cannabidiolic acid (CBDA). The present inventors also demonstrated the in vivo functionalization and glycosylation of cannabinoids in a cell suspension culture. Specifically, the inventors used exemplary tobacco bright yellow (BY2) cells as a cell suspension system for studies of cannabinoid production, functionalization and/or glycosylation.

Example 13: Transient Transformation of the Exemplary Plant Model Nicotiana benthamiana

The present inventors used Agrobacterium tumefaciens Ti-plasmid-mediated transformation with the plant expression vector pRI201-AN (Takara Bio USA), a binary vector for high-level expression of a foreign gene in dicotyledonous plants carrying the constitutive 35S promoter and an Arabidopsis thaliana Alcohol dehydrogenase (AtAdh) as a translational enhancer (Matsui et al. 2012). N. benthamiana was transiently transformed according to the method described by Sparkes et al. 2006. Overnight cultures of Agrobacterium strain GV3101 were transferred to a 250 mL flask with 50 mL LB medium supplemented with 50 mg/L of Kanamycin, 50 mg/L of Gentamycin and 10 mg/L of Rifampicin and grown for 4-8 hours until the optical density at 600 nm (OD600) reached approximately between 0.75 and 1. The cells were pelleted in a centrifuge at room temperature and resuspended in 45 mL of infiltration medium containing 5 g/L D-glucose, 10 mM MES, 10 mM MgCl2 and 100 μM acetosyringone. 1 ml of the solution was used to infiltrate the leaves using a 1 mL syringe. Expression of the transgene(s) was confirmed 2-4 days after infiltration by RT-PCR. For RT-PCR analysis, 100 mg of leaf tissue were frozen in liquid nitrogen and ground in a TissueLyser (QIAGEN Inc, USA). RNA was extracted following the EZNA plant RNA extraction kit (Omega Bio-tek Inc, USA). Up to a microgram of total RNA was used to synthesize cDNA using the superscript III cDNA synthesis kit (Thermo Fisher Scientific, USA). The cDNA was used to check for the expression of transgene(s) by RT-PCR.

Example 14: Introduction of Select Cannabinoid Substrate(s) to the Transgenic N. benthamiana Strain

Select enzyme substrates were introduced to the transgenic or genetically modified N. benthamiana strain two days after Agrobacterium infiltration and upon confirmation of transgene expression by RT-PCR. In this example, approximately 277 μM cannabigerolic acid (CBGA) and/or cannabidiolic acid (CBDA) was dissolved in 1 mL of buffer containing 10 mM MES, 10 mM MgCl2 and 0.1% Triton X100 or 0.1% Tween20 and applied to the transformed leaves either by infiltration or by dabbing with a cotton applicator. Plants were harvested after 1-4 days, weighed for fresh weight and frozen at −80° C. before conducting LC-MS analysis for the presence of modified cannabinoids.

Example 15: In Vitro Assays for CBDA Synthase and Glycosyltransferase Activity

CBDA synthase is generally active in the pH range 4-6 (Taura et al. 1996) while glycosyltransferases are typically active in the pH range 5.0 to 7.0 (Rini and Esko, 2017). Based on this difference in optimal pH for enzyme activity, the present inventors generated a single extraction buffer for a combined assay of CBDA synthase and UDP glycosyltransferase at pH 6 and 30° C. in in vitro assays (Priest et al., 2006). The present inventors ground the transformed leaf tissue in liquid nitrogen. A grinding buffer was added consisting of 50 mM MES, pH 6, 1 mM EDTA, 5 mM β-mercaptoethanol and 0.1% Triton X-100 was added at 5:1 ratio of buffer to fresh weight of plant using a mortar and pestle. The extract was filtered on ice through 2 layers of cheesecloth to remove debris and centrifuged at 21000 g for 5 minutes at 4° C. The supernatant was used in subsequent assays. Protein concentration of the supernatant was quantified by the Bradford assay, using bovine serum albumin as the standard. To start the reaction, 100-200 μg of crude total protein was used. The assay was carried out with and without UDP-glucose to check if glycosylation of cannabinoid substrate was preventing downstream reactions or transport of CBGA. Wild type plants were used as controls to separate endogenous from overexpressed UDP glycosyltransferase activity. The reaction was started by adding 100 μg of protein, and 8 mM uridine diphosphate glucose (UDPG) as the sugar-nucleotide donor to a reaction mixture consisting of approximately 277 μM CBGA, 0.1% (w/v) Triton X-100, 3 mM MgCl2 and 50 mM MES (pH 6.0). The reaction was incubated at 30° C. for 3 h or overnight for 14 hours. The reaction was terminated by freezing in liquid nitrogen and the samples were stored at −80° C. before LC-MS analysis.

Example 16: Trichome-Targeted Synthesis and Glycosylation

As an exemplary plant model, N. benthamiana plants were grown from seed and, after 4 weeks of vegetative growth, the leaves were co-infiltrated with Agrobacterium tumefaciens GV3101 carrying the following constructs: Trichome CBDAs+trichome UGT in pRI201-AN (trichome construct), PM-UTR1 in pRI201-AN, and p19 silencing suppressor in pDGB3alpha2. In a second experiment, leaves were also infiltrated with the Agrobacterium expressing a Ti-plasmid with the Myb/catalase genes. Agrobacterium density was normalized to 1 or 2 at absorbance of 600 nm using a spectrophotometer and cultures co-infiltrated in same ratio (1:1:1). After 1 and 4 days post Agrobacterium infiltration (DPI), 1 mL CBGA (277 μM) dissolved in 0.1% Tween20 (Sigma-Aldrich) or 3% DMSO (Sigma-Aldrich) was infiltrated to each leaf. Three biological replicates were used. The experiment was repeated twice. After preliminary results, Agrobacterium densities of 2 at OD₆₀₀ were selected for all following infiltration experiments. Moreover, 0.1% Tween20 was chosen over DMSO 3% due to better solubilizing CBGA substrate.

In this embodiment, leaf samples were collected at 2 DPI and immediately frozen in liquid nitrogen. RNA extraction was done using RNA plant mini-kit as described by manufacturer (Qiagen). cDNA was synthesized using RNA to cDNA Ecodry Premix as described by manufacturer (Takara). Template cDNA was normalized to 50 ng of corresponding total RNA per reaction. Annealing temperature in Celsius: 60. Extension time: 15s. 35 cycles. Q5 DNA polymerase kit used as described by manufacturer (New England Biolabs). RT-PCR primers are outlined in Table 5 below.

Example 17: Transient Transformation of Cannabis sativa

The present inventors performed Agrobacterium tumefaciens-mediated transient transformation of Cannabis sativa. The experimental groups consisted of young leaves of high CBD variety (˜10% in dried flowers) and trichome leaves of high THC variety (˜20% dried flowers).

To transform leaves of high CBD varieties, the present inventors germinated 100 seeds three times; this was done to ensure that a sufficient number of plants would be available for all 9 independent transformation events. To transform trichome leaves, the present inventors used small trichome-containing leaves of several varieties known to be high THC varieties. Experimental set up consisted of 2 different Agrobacterium tumefaciens strains. For transient transformation of Agrobacterium strain EHA 105, the present inventors grew cells in 10 ml of LB medium supplemented with 100 mg/L of Rifampicin and 50 mg/L of Kanamycin and for Agrobacterium strain GV3101::6000 cells were grown with 50 mg/L of Kanamycin, 25 mg/L of Gentamycin and 50 mg/L of Rifampicin. A single Agrobacterium colony was used for inoculation and grown overnight. Then, 1 ml of this culture was inoculated into 500 ml of aforementioned LB medium supplemented with 20 μM acetosyringone. Agrobacteria were grown to OD₆₀₀ of approximately between 1 and 1.5. The cells were pelleted in a centrifuge at room temperature and resuspended in infiltration medium containing 10 mM MES, 10 mM MgCl2 and 200 μM acetosyringone to an OD₆₀₀ of 0.5.

Bacterial culture was then used for three different types of Cannabis sativa transformations. In all cases, transformation was done in the form of co-transformation, mixing all relevant strains (plasmids) in equal proportion of cell numbers. First, for the present inventors infiltrated young (two weeks old) fully expended Cannabis sativa plants using 1 ml syringe. Prior to transformation, plants were kept under plastic cover, to ensure maximum softness of the leaves. Infiltration was performed from abaxial side, ensuring that the entire surface of the leaf is infiltrated at 12/h/12 h day/night at 22° C.

Second, the present inventors vacuum infiltrated detached young (two weeks old) fully expended Cannabis sativa leaves. Prior to transformation, plants were kept under plastic cover, to ensure maximum softness of the leaves. Leaves were then placed on half-strength Murashige and Skoog (1962) (½ MS) agar supplemented with 61.8 mM ammonium nitrate and incubated for 5 days at 12/h/12 h day/night at 22° C.

Third, trichome leaves were detached, placed into 50 ml Falcon tubes and vacuum infiltrated with aforementioned bacterial solution 2× for 10 min each. Leaves were then placed on ½ MS agar supplemented with 61.8 mM ammonium nitrate and incubated for 5 days.

All experiments were done in triplicates, with the fourth replicate done for collection of DNA/RNA and staining X-gluc for measuring the activity of beta-glucuronidase (GUS) after co-infiltration with Agrobacterium-containing GUS gene. In all cases, leaves were harvested after 5 days of transformation, frozen in liquid nitrogen and stored at −80° C.

Example 18: Extraction of Water-Soluble Cannabinoids from N. benthamiana

Fresh transformed plant material was harvested from greenhouse experiments in 15 or 50 mL polypropylene centrifuge tubes and flash frozen in liquid N2. The frozen plant material was enzymatically quenched by submersing the plant material in boiling methanol for 2 min. The methanol-quenched material was homogenised using a P-10-35 homogenizer (Kinematica, Bohemia N.Y.). The homogenate was extracted by brief agitation in a final volume of 10 mL or 30 mL 70% methanol (v/v) respective to tube size. The resulting extracts were clarified by centrifugation at 2,500 rpm at 4° C. for 15 minutes in a Beckman J-6B floor centrifuge (Beckman Coulter, Indianapolis Ind.). The supernatant was transferred into a polypropylene tube and evaporated under a stream of N2 at 45° C. until dried. The extracts were reconstituted in methanol containing 20 μg/mL of the internal standard 7-Hydroxyoumarin (Sigma-Aldrich, H24003). The reconstituted extracts were placed into 1.5 mL microfuge tubes and clarified in a microcentrifuge at 10,000 g for 15 min. 500 μL of the supernatant was transferred to a 2 mL auto sampler vial and kept at 4° C. until analysis. In vitro assays sample preparation: samples were syringed filtered through 0.45 μm PVDF membrane into a 2 mL auto sampler vial.

Example 19: Extraction of Water-Soluble Cannabinoids from Cannabis sativa

Fresh plant material was harvested from plants grown in chamber in 1.5 mL polypropylene centrifuge tubes and flash frozen in liquid N2. The frozen plant material was homogenized using pestle and mortar and enzymatically quenched by submersing the plant material in boiling 100% ethanol for 2 min. Homogenized solution was diluted to 70% ethanol. The resulting extracts were clarified by centrifugation at 2,500 rpm at 4° C. for 15 minutes in Eppendorf centrifuge (Centrifuge 5415 R). The supernatant was transferred into a polypropylene tube and concentrated three times using vacuum centrifuge (Speedvac SC110, Savant). 2 μl of 20 μg/mL of the internal standard Umbelliferone (Sigma-Aldrich, H24003) was added to 98 μl of concentrated extract and taken for analysis.

Example 20: Liquid Chromatography Mass Spectrometry Used to Confirm Functionalization and Glycosylation of Cannabinoids

The present inventor used liquid chromatography mass spectrometry to confirm functionalization and glycosylation of cannabinoids in the exemplary plant systems described herein. Specifically, mass spectrometry was performed on a quadrupole time-of-flight (QTOF) mass spectrometer (QTOF Micro, Waters, Manchester, UK) equipped with a Lockspray™ electrospray ion source coupled to a Waters Acquity UPLC system (Waters, Manchester, UK). Mass spectra were collected in the negative electrospray ionization mode (ESI−). The nebulization gas was set to 400 L/h at a temperature of 350° C., the cone gas was set to 15 L/H and the source temperature was set to 110° C. A capillary voltage and cone voltage were set to 2500 and 35 V, respectively. The MCP detector voltage was set to 2500 V. The Q-TOF micro MS acquisition rate was set to 1.0 s with a 0.1 s interscan delay. The scan range was from 100 to 1500 m/z. Data was collected in continuum mode. A lockmass solution of 50 ppm raffinose (503.1612 m/z) in 50:50 water:methanol was delivered at 20 μL/min through an auxiliary pump and acquired every 10 s during the MS acquisition. Separations were performed on a Waters HSS T3 C18 column (2.1×100 mm, particle size 1.8 μm) using a Waters ACQUITY UPLC System, equipped with an ACQUITY Binary Solvent Manager, ACQUITY Column Manager and ACQUITY Sample Manager (10 μL sample loop, partial loop injection mode, 5 μL injection volume, 4° C.). Eluents A and B were water and acetonitrile, respectively, both containing 0.1% formic acid. Elution was performed isocratically for 0.5 min at 10% eluent B and then linear gradient 100% eluent B in 14.5 min, and isocratically for 3 min at 100% eluent B. The column was re-equilibrated for 6 min. The flow rate was set to 250 μL/min and the column temperature was maintained at 30° C.

Example 21: Demonstrates Materials and Methods for Data Processing

Identification of individual cannabinoid analogs was performed by the present inventors, by their corresponding accurate mass shifts by Metabolynx (Waters Corp., Milford, USA). The method parameters for data processing were set as follows: retention time range 0.1-18 min, mass range 100-1500 Da, retention time tolerance 0.2 min, mass tolerance 0.05 Da, peak intensity threshold 14. Accurate mass measure of the continuum data was performed using the raffinose lock mass. Raw chromatographic data were additionally processed for extracted ion chromatogram sand peak area integration using Masslynx 4.1 (Waters Corp., Milford, USA). The select cannabinoids, CBGA and CBDA were identified and quantitated using certified reference materials (Cerilliant, Round Rock, Tex.). All chemical structures and physiochemical and constitutional properties were generated using ChemDoodle version 8.1.0 (IChemLabs™, Chesterfield, Va.).

Tables

TABLE 1 CBGA Biotransformed Products Molecular RRT to Expected Error Error Formula [M − Product Parent m/z Found m/z (mDa) (ppm) H]− R-OH 1 × Glycoside 0.58 537.2700 537.2703 −0.30 0.6 C28H41O10 2 × Glycoside 0.59 683.3279 683.3258 2.10 −3.1 C34H51O14 1 × O acetyl Glycoside 0.73 563.2856 563.2844 1.20 −2.1 C30H43O10 1 × Glycoside #1 0.74 521.2751 521.2734 1.70 −3.3 C28H41O9 R-OH #1 0.80 375.2171 375.2224 −5.30 14.1 C22H31O5 1 × Glycoside #2 0.81 521.2751 521.2727 2.40 −4.6 C28H41O9 R-OH #2 0.81 375.2171 375.2237 −6.60 17.6 C22H31O5 R-OH #3 0.94 375.2171 375.2192 −2.10 5.6 C22H31O5 CBGA 1.00 359.2222 359.2245 −2.30 6.4 C22H31O4 RRT Relative Retention Time to Parent Molecule R-OH Functionalized by addition of O atom

TABLE 2 CBDA Biotransformed Products Molecular RRT to Expected Found Error Error Formula Product Parent m/z m/z (mDa) (ppm) [M − H]− 2 × Glycoside 0.56 681.3122 681.3097 2.50 −3.7 C34H49O14 R-OH 1 × Glycoside 0.61 535.2543 535.2599 −5.60 10.5 C28H39O10 1 × Glycoside 0.71 519.2601 519.2594 0.70 1.3 C28H39O9 1 × O acetyl Glycoside 0.71 561.2700 561.2700 0.00 0 C30H41O10 R-OH #1 0.84 373.2015 373.2074 −5.90 15.8 C22H29O5 R-OH #2 0.87 373.2015 373.2034 −1.90 5.1 C22H29O5 R-OH #3 0.96 373.2015 373.2040 −2.50 −8 C22H29O5 CBDA 1.00 357.2066 357.2122 −5.60 15.7 C22H29O4 RRT Relative Retention Time to Parent Molecule R-OH Functionalized by addition of O atom′

TABLE 3 Forward and reverse primers for RT-PCR of CYP3A4 and P450 oxidoreductase Sequence CYP3A4 P450 oxidoreductase Primers for Forward TGCCTAATAAAGCTCCTCCTACT Forward GGAAGAGCTTTGGTTCCTATGT RT-PCR (SEQ ID NO. 51) (SEQ ID NO. 53) Reverse GCTCCTGAAACAGTTCCATCTC Reverse GCTCCCAATTCAGCAACAATATC (SEQ ID NO. 52) (SEQ ID NO. 54)

TABLE 4 Forward and reverse primers for CBDA synthase, UGT76G1 and ABCG2 Sequence CBDA synthase UGT76G1 ABCG2 Primers for Forward primer: Forward primer: Forward primer: RT-PCR ACATCACAATCACACA GATTGGAAGAACAAGCTT CCTTCAGGATTGTCAGGA AAACTAACAAAAG CAGGATTTCC GATG (SEQ ID NO. 55) (SEQ ID NO. 57) (SEQ ID NO. 59) Reverse primer: Reverse primer: Reverse primer: GGCCATAGTTTCTCAT CCATCCTGAATGAGTCCA GCAGGTCCATGAAACAT CAATGG AAAAGCTC CAATC (SEQ ID NO. 56) (SEQ ID NO. 58) (SEQ ID NO. 60)

TABLE 5 Trichome-targeted CBDA synthase (CBDAs), Trichome-targeted UGT and PM- targeted UTR1 Trichome-targeted Plasma membrane-targeted Sequence CBDAs Trichome-targeted UGT UTR1 Primers for Forward primer: Forward primer: Forward primer: RT-PCR AAAGATCAAAAGCAA AGTGCTCAACATTCTCCTT TTGTTCCTTAAACCTCGC GTTCTTCACTGT TTGGTT CTTTGAC (SEQ ID NO. 61) (SEQ ID NO. 63) (SEQ ID NO. 65) Reverse primer: Reverse primer: Reverse primer: CCATGCAGTTTGGCTA TCTGAAGCCAACATCAAC TCATTATGGAGCACTCCA TGAACATCT AATTCCA CTCTCTG (SEQ ID NO. 62) (SEQ ID NO. 64) (SEQ ID NO. 66)

TABLE 6 Cytosolic-targeted CBDA synthase (cytCBDAs), Cytosolic-targeted UGT (cytUGT) Sequence Cytosolic-targeted CBDA synthase Cytosolic-targeted UGT Primers for Forward primer: Forward primer: RT-PCR AAAGATCAAAAGCAAGTTCTTCACTGT AGAACTGGAAGAATCCGAACTGGAA (SEQ ID NO. 67) (SEQ ID NO. 69) Reverse primer: Reverse primer: ATAAACTTCTCCAAGGGTAGCTCCG AAATCATCGGGACACCTTCACAAAC (SEQ ID NO. 68) (SEQ ID NO. 70)

TABLE 7 Summary of results from glycosylation and functionalization experiments in N. benthamiana leaves. CBGA CBGA glycoside + CBDA CBDA CBGA glycoside acetylated CBDA glycoside Hydroxyl Agrobacterium Substrate (relative (relative (relative (relative (relative (relative Constructs fed amount) amount) amount) amount) amount) amount) Trichome CBDA CBGA + + + + ND ND synthase + trichome glycosyltransferase + PM-UTR1) + Myb/catalase* + P19 silencing supressor* Cytosolic CBDA CBGA + +++ +++ +++ ND ND synthase, glycosyltransferase and plasma membrane ABC transporter) + Myb/catalase + P19 silencing suppressor 201-SUS (cytosolic CBGA + +++ ++++ + + + CBDA synthase, glycosyltransferase and plasma membrane ABC transporter) CYP3A4 + oxidoreductase CBDA ND + ND +++ +++++ +++++ (cytochrome P450 with P450 oxidoreductase) Cytosolic CBDA CBGA ++++ +++++ +++++ ND ++ ++ synthase + cytosolic glycosyltransferase + Myb/catalase* + P19 silencing suppressor* P450/ CBGA + ++++ + ND ++ ++ MYBcatalase/cytosolic CBDA synthase, glycosyltransferase and plasma membrane ABC transporter No agrobacterium CBGA + + + ND ND ND (negative control) *Co-infiltration with and without construct was tested in different replicates

TABLE 8 Summary of results from glycosylation and functionalization experiments in Cannabis sativa leaves. CBDA CBDA CBDA glycoside Hydroxyl (relative (relative (relative Agrobacterium Constructs amount) amount) amount) Trichome CBDA synthase + ++ trace trace trichome glycosyltransferase + plasma membrane-targeted sugar transporter) + Myb/catalase cytosolic CBDA synthase, +++ ++++ +++++ cytosolic glycosyltransferase + Myb/catalase 201-SUS (cytosolic CBDA ++ ++ ++ synthase, glycosyltransferase and plasma membrane ABC transporter)

TABLE 9 Exemplary Glycosyltransferase sequence identification SEQ ID NO. Name Organism Type SEQ ID NO. 26 NtGT5a Nicotiana tabacum Amino Acid SEQ ID NO. 27 NtGT5a Nicotiana tabacum DNA SEQ ID NO. 28 NtGT5b Nicotiana tabacum Amino Acid SEQ ID NO. 29 NtGT5b Nicotiana tabacum DNA SEQ ID NO. 30 NtGT4 Nicotiana tabacum Amino Acid SEQ ID NO. 31 NtGT4 Nicotiana tabacum DNA SEQ ID NO. 32 NtGT1b Nicotiana tabacum Amino Acid SEQ ID NO. 33 NtGT1b Nicotiana tabacum DNA SEQ ID NO. 34 NtGT1a Nicotiana tabacum Amino Acid SEQ ID NO. 35 NtGT1a Nicotiana tabacum DNA SEQ ID NO. 36 NtGT3 Nicotiana tabacum Amino Acid SEQ ID NO. 37 NtGT3 Nicotiana tabacum DNA SEQ ID NO. 38 NtGT2 Nicotiana tabacum Amino Acid SEQ ID NO. 39 NtGT2 Nicotiana tabacum DNA

TABLE 10 Cannabinoid production cellular compartmentalization models. Different shaded columns and rows correspond to different exemplary expression constructs used. Catalase to degrade Cannabinoid Myb H₂O₂ production/ UDP Cannabinoid transcription from accumulation CBDA glycosyl ABC UDP glucose factor for CBDA system Synthase transferase transporter transporter cannabinoids Synthase Cytoplasmic Minus Required but No gene No gene Express Express accumulation trichome no targeting required required target change sequence Trichome No change Add No gene Target to Express Express (low pH) trichome required plasma synthesis target membrane sequence Cell Minus Required but Target to No gene Express Express suspension trichome no targeting plasma required cultures target change membrane sequence (PM)

REFERENCES

The following references are hereby incorporated in their entirety by reference:

-   [1] I von Ossowski, M R Mulvey, P A Leco, A Borys and P C Loewen, J.     Bacteriol. 1991, 173(2):514. -   [2] Behera, A., Behera, A., Mishra, S. C., Swain, S. K., &     Author, C. (2003). Cannabinoid glycosides: In vitro production of a     new class of cannabinoids with improved physicochemical properties.     Proc. Intl. Soc. Mag. Reson. Med (Vol. 14). -   [3] Holland, M. L., Lau, D. T. T., Allen, J. D., & Arnold, J. C.     (2009). The multidrug transporter ABCG2 (BCRP) is inhibited by     plant-derived cannabinoids. British Journal of Pharmacology, 152(5),     815-824. -   [4] Ivanchenco. M., Vejlupkova. Z., Quatrano. R. S.,     Fowler. J. E. (2000) Maize ROP7 GTPase contains a unique, CaaX     box-independent plasma membrane targeting signal. The Plant Journal,     (24)1, 79-90. -   [5] James M. Rini and Jeffrey D. Esko. Glycosyltransferases and     Glycan-Processing Enzymes. In: Essentials of Glycobiology     [Internet]. 3rd edition. -   [6] Marks, M. D., Tian, L., Wenger, J. P., Omburo, S. N.,     Soto-Fuentes, W., He, J., . . . Dixon, R. A. (2009). Identification     of candidate genes affecting Δ9-tetrahydrocannabinol biosynthesis in     Cannabis sativa. Journal of Experimental Botany, 60(13), 3715-3726. -   [7] Nagaya, S., Kawamura, K., Shinmyo, A., & Kato, K. (2010). The     HSP terminator of Arabidopsis thaliana increases gene expression in     plant cells. Plant and Cell Physiology, 51(2), 328-332. -   [8] Norambuena, L., Marchant, L., Berninsone, P., Hirschberg, C. B.,     Silva, H., & Orellana, A. (2002). Transport of UDP-galactose in     plants. Identification and functional characterization of AtUTr1, an     Arabidopsis thaliana UDP-galactose/UDP-glucose transporter. Journal     of Biological Chemistry, 277(36), 32923-32929. -   [9] Onofri, C., De Meijer, E. P. M., & Mandolino, G. (2015).     Sequence heterogeneity of cannabidiolic- and tetrahydrocannabinolic     acid-synthase in Cannabis sativa L. and its relationship with     chemical phenotype. Phytochemistry, 116(1), 57-68. -   [9] Priest, D. M., Ambrose, S. J., Vaistij, F. E., Elias, L.,     Higgins, G. S., Ross, A. R. S., . . . Bowles, D. J. (2006). Use of     the glucosyltransferase UGT71B6 to disturb abscisic acid homeostasis     in Arabidopsis thaliana. Plant Journal, 46(3), 492-502. -   [10] Siritunga, D., and Sayre, R. T. (2003). Generation of     cyanogen-free transgenic cassava. Planta 217, 367-373. -   [11] Sparkes, I. A., Runions, J., Kearns, A., & Hawes, C. (2006).     Rapid, transient expression of fluorescent fusion proteins in     tobacco plants and generation of stably transformed plants. Nature     Protocols, 1(4), 2019-2025. -   [13] Taura, F., Morimoto, S., & Shoyama, Y. (1996). Purification and     characterization of cannabidiolic-acid synthase from Cannabis     sativa L. Biochemical analysis of a novel enzyme that catalyzes the     oxidocyclization of Journal of Biological Chemistry, 27/(29),     17411-17416. -   [14] Taura, F., Sirikantaramas, S., Shoyama Y, Yoshikai K, Shoyama     Y, Morimoto S. (2007) Cannabidiolic-acid synthase, the     chemotype-determining enzyme in the fiber-type Cannabis sativa.     Febbs letters, 581(16), 2929-34. -   [15] Yoo, S. D., Cho, Y. H., & Sheen, J. (2007). Arabidopsis     mesophyll protoplasts: A versatile cell system for transient gene     expression analysis. Nature Protocols, 2(7), 1565-1572. -   [16] Matsui, T., Matsuura, H., Sawada, K., Takita, E., Kinjo, S.,     Takenami, S., . . . Kato, K. (2012). High level expression of     transgenes by use of 5′-untranslated region of the Arabidopsis     thaliana arabinogalactan-protein 21 gene in dicotyledons. Plant     Biotechnology, 29(3), 319-322.     https://doi.org/10.5511/plantbiotechnology.0.12.0322a -   [17] Murashige, T., and Skoog, F. (1962). A revised medium for rapid     growth and bioassays with tobacco tissue culture. Physiol. Plant.     15, 473-497. -   [18] Zipp, et al., Cannabinoid glycosides: In vitro production of a     new class of cannabinoids with improved physicochemical properties.     bioRxiv preprint -   [19] Mohamed, E. A., T. Iwaki, I. Munir, M. Tamoi, S. Shigeoka,     and A. Wadano. 2003. Overexpression of bacterial catalase in tomato     leaf chloroplasts enhances photo-oxidative stress tolerance. Plant     Cell Environ. 26:2037-2046. -   [20] Akhtar, M. T., 2013, Doctoral Thesis, Leiden University.     Cannabinoids and zebrafish. 2013 May 22. -   [21] Sayed Farag. Cannabinoids production in Cannabis sativa L.: An     in vitro approach. Thesis January 2014. -   [21] K, Watanabe, et al., Cytochrome P450 enzymes involved in the     metabolism of tetrahydrocannabinols and cannabinol by human hepatic     microsomes. Life Sciences. Volume 80, Issue 15, 20 Mar. 2007, Pages     1415-1419 [22] Flores-Sanchez I J. et al., Elicitation studies in     cell suspension cultures of Cannabis sativa L. J Biotechnol. 2009     Aug. 20; 143(2):157-68. -   [23] Stephen M. Stout & Nina M. Cimino (2013) Exogenous cannabinoids     as substrates, inhibitors, and inducers of human drug metabolizing     enzymes: a systematic review, Drug Metabolism Reviews, 46:1, 86-95     [24] Andre C M, Hausman J-F, Guerriero G. Cannabis sativa: The Plant     of the Thousand and One Molecules. Frontiers in Plant Science. 2016;     7:19. -   [25] Mahlberg Pl. et al., Accumulation of Cannabinoids in Glandular     Trichomes of Cannabis (Cannabaceae). Journal of Industrial Hemp     9(1):15-36. -   [25] Katalin S., et al., Mini Rev Med Chem. 2017; 17(13):1223-1291. -   [26] Sirikantaramas S., et al., Tetrahydrocannabinolic Acid     Synthase, the Enzyme Controlling Marijuana Psychoactivity, is     Secreted into the Storage Cavity of the Glandular Trichomes. Plant     and Cell Physiology, Volume 46, Issue 9, 1 Sep. 2005, Pages     1578-1582. -   [26] Schilmiller A L, Last R L, Pichersky E (2008) Harnessing plant     trichome biochemistry for the production of useful compounds. Plant     Journal 54: 702-711. -   [27] Matias-Hernandez, L. et al. AaMYB1 and its orthologue AtMYB61     affect terpene metabolism and trichome development in Artemisia     annua and Arabidopsis thaliana. Plant J. 2017; 90: 520-534.

SEQUENCE LISTINGS

As noted above, the instant application contains a full Sequence Listing which has been submitted electronically in ASCII format and is hereby incorporated by reference in its entirety. The following sequences are further provided herewith and are hereby incorporated into the specification in their entirety:

SEQ ID NO. 1 DNA Cytochrome P450 (CYP3A4) Human ATGGCTTTGATTCCTGATTTGGCTATGGAAACTAGATTGTTGTTGGCTGTTTCATTGGTTTTGT TGTATTTGTATGGAACTCATTCACATGGATTGTTTAAAAAATTGGGAATTCCTGGACCTACTCC TTTGCCTTTTTTGGGAAATATTTTGTCATATCATAAAGGATTTTGCATGTTTGATATGGAATGC CATAAAAAATATGGAAAAGTTTGGGGATTTTATGATGGACAACAACCTGTTTTGGCTATTACTG ATCCTGATATGATTAAAACTGTTTTGGTTAAAGAATGCTATTCAGTTTTTACTAATAGAAGACC TTTTGGACCTGTTGGATTTATGAAATCAGCTATTTCAATTGCTGAAGATGAAGAATGGAAAAGA TTGAGATCATTGTTGTCACCTACTTTTACTTCAGGAAAATTGAAAGAAATGGTTCCTATTATTG CTCAATATGGAGATGTTTTGGTTAGAAATTTGAGAAGAGAAGCTGAAACTGGAAAACCTGTTAC TTTGAAAGATGTTTTTGGAGCTTATTCAATGGATGTTATTACTTCAACTTCATTTGGAGTTAAT ATTGATTCATTGAATAATCCTCAAGATCCTTTTGTTGAAAATACTAAAAAATTGTTGAGATTTG ATTTTTTGGATCCTTTTTTTTTGTCAATTACTGTTTTTCCTTTTTTGATTCCTATTTTGGAAGT TTTGAATATTTGCGTTTTTCCTAGAGAAGTTACTAATTTTTTGAGAAAATCAGTTAAAAGAATG AAAGAATCAAGATTGGAAGATACTCAAAAACATAGAGTTGATTTTTTGCAATTGATGATTGATT CACAAAATTCAAAAGAAACTGAATCACATAAAGCTTTGTCAGATTTGGAATTGGTTGCTCAATC AATTATTTTTATTTTTGCTGGATGCGAAACTACTTCATCAGTTTTGTCATTTATTATGTATGAA TTGGCTACTCATCCTGATGTTCAACAAAAATTGCAAGAAGAAATTGATGCTGTTTTGCCTAATA AAGCTCCTCCTACTTATGATACTGTTTTGCAAATGGAATATTTGGATATGGTTGTTAATGAAAC TTTGAGATTGTTTCCTATTGCTATGAGATTGGAAAGAGTTTGCAAAAAAGATGTTGAAATTAAT GGAATGTTTATTCCTAAAGGAGTTGTTGTTATGATTCCTTCATATGCTTTGCATAGAGATCCTA AATATTGGACTGAACCTGAAAAATTTTTGCCTGAAAGATTTTCAAAAAAAAATAAAGATAATAT TGATCCTTATATTTATACTCCTTTTGGATCAGGACCTAGAAATTGCATTGGAATGAGATTTGCT TTGATGAATATGAAATTGGCTTTGATTAGAGTTTTGCAAAATTTTTCATTTAAACCTTGCAAAG AAACTCAAATTCCTTTGAAATTGTCATTGGGAGGATTGTTGCAACCTGAAAAACCTGTTGTTTT GAAAGTTGAATCAAGAGATGGAACTGTTTCAGGAGCT SEQ ID NO. 2 Amino Acid Cytochrome P450 (CYP3A4) Human MALIPDLAMETRLLLAVSLVLLYLYGTHSHGLFKKLGIPGPTPLPFLGNILSYHKGFCMFDMEC HKKYGKVWGFYDGQQPVLAITDPDMIKTVLVKECYSVFTNRRPFGPVGFMKSAISIAEDEEWKR LRSLLSPTFTSGKLKEMVPIIAQYGDVLVRNLRREAETGKPVTLKDVFGAYSMDVITSTSFGVN IDSLNNPQDPFVENTKKLLRFDFLDPFFLSITVFPFLIPILEVLNICVFPREVTNFLRKSVKRM KESRLEDTQKHRVDFLQLMIDSQNSKETESHKALSDLELVAQSIIFIFAGCETTSSVLSFIMYE LATHPDVQQKLQEEIDAVLPNKAPPTYDTVLQMEYLDMVVNETLRLFPIAMRLERVCKKDVEIN GMFIPKGVVVMIPSYALHRDPKYWTEPEKFLPERFSKKNKDNIDPYIYTPFGSGPRNCIGMRFA LMNMKLALIRVLQNFSFKPCKETQIPLKLSLGGLLQPEKPVVLKVESRDGTVSGA SEQ ID NO. 3 DNA P450 oxidoreductase gene (oxred) Human ATGATTAATATGGGAGATTCACATGTTGATACTTCATCAACTGTTTCAGAAGCTGTTGCTGAAG AAGTTTCATTGTTTTCAATGACTGATATGATTTTGTTTTCATTGATTGTTGGATTGTTGACTTA TTGGTTTTTGTTTAGAAAAAAAAAAGAAGAAGTTCCTGAATTTACTAAAATTCAAACTTTGACT TCATCAGTTAGAGAATCATCATTTGTTGAAAAAATGAAAAAAACTGGAAGAAATATTATTGTTT TTTATGGATCACAAACTGGAACTGCTGAAGAATTTGCTAATAGATTGTCAAAAGATGCTCATAG ATATGGAATGAGAGGAATGTCAGCTGATCCTGAAGAATATGATTTGGCTGATTTGTCATCATTG CCTGAAATTGATAATGCTTTGGTTGTTTTTTGCATGGCTACTTATGGAGAAGGAGATCCTACTG ATAATGCTCAAGATTTTTATGATTGGTTGCAAGAAACTGATGTTGATTTGTCAGGAGTTAAATT TGCTGTTTTTGGATTGGGAAATAAAACTTATGAACATTTTAATGCTATGGGAAAATATGTTGAT AAAAGATTGGAACAATTGGGAGCTCAAAGAATTTTTGAATTGGGATTGGGAGATGATGATGGAA ATTTGGAAGAAGATTTTATTACTTGGAGAGAACAATTTTGGTTGGCTGTTTGCGAACATTTTGG AGTTGAAGCTACTGGAGAAGAATCATCAATTAGACAATATGAATTGGTTGTTCATACTGATATT GATGCTGCTAAAGTTTATATGGGAGAAATGGGAAGATTGAAATCATATGAAAATCAAAAACCTC CTTTTGATGCTAAAAATCCTTTTTTGGCTGCTGTTACTACTAATAGAAAATTGAATCAAGGAAC TGAAAGACATTTGATGCATTTGGAATTGGATATTTCAGATTCAAAAATTAGATATGAATCAGGA GATCATGTTGCTGTTTATCCTGCTAATGATTCAGCTTTGGTTAATCAATTGGGAAAAATTTTGG GAGCTGATTTGGATGTTGTTATGTCATTGAATAATTTGGATGAAGAATCAAATAAAAAACATCC TTTTCCTTGCCCTACTTCATATAGAACTGCTTTGACTTATTATTTGGATATTACTAATCCTCCT AGAACTAATGTTTTGTATGAATTGGCTCAATATGCTTCAGAACCTTCAGAACAAGAATTGTTGA GAAAAATGGCTTCATCATCAGGAGAAGGAAAAGAATTGTATTTGTCATGGGTTGTTGAAGCTAG AAGACATATTTTGGCTATTTTGCAAGATTGCCCTTCATTGAGACCTCCTATTGATCATTTGTGC GAATTGTTGCCTAGATTGCAAGCTAGATATTATTCAATTGCTTCATCATCAAAAGTTCATCCTA ATTCAGTTCATATTTGCGCTGTTGTTGTTGAATATGAAACTAAAGCTGGAAGAATTAATAAAGG AGTTGCTACTAATTGGTTGAGAGCTAAAGAACCTGTTGGAGAAAATGGAGGAAGAGCTTTGGTT CCTATGTTTGTTAGAAAATCACAATTTAGATTGCCTTTTAAAGCTACTACTCCTGTTATTATGG TTGGACCTGGAACTGGAGTTGCTCCTTTTATTGGATTTATTCAAGAAAGAGCTTGGTTGAGACA ACAAGGAAAAGAAGTTGGAGAAACTTTGTTGTATTATGGATGCAGAAGATCAGATGAAGATTAT TTGTATAGAGAAGAATTGGCTCAATTTCATAGAGATGGAGCTTTGACTCAATTGAATGTTGCTT TTTCAAGAGAACAATCACATAAAGTTTATGTTCAACATTTGTTGAAACAAGATAGAGAACATTT GTGGAAATTGATTGAAGGAGGAGCTCATATTTATGTTTGCGGAGATGCTAGAAATATGGCTAGA GATGTTCAAAATACTTTTTATGATATTGTTGCTGAATTGGGAGCTATGGAACATGCTCAAGCTG TTGATTATATTAAAAAATTGATGACTAAAGGAAGATATTCATTGGATGTTTGGTCA SEQ ID NO. 4 Amino Acid P450 oxidoreductase Human MINMGDSHVDTSSTVSEAVAEEVSLFSMTDMILFSLIVGLLTYWFLFRKKKEEVPEFTKIQTLT SSVRESSFVEKMKKTGRNIIVFYGSQTGTAEEFANRLSKDAHRYGMRGMSADPEEYDLADLSSL PEIDNALVVFCMATYGEGDPTDNAQDFYDWLQETDVDLSGVKFAVFGLGNKTYEHFNAMGKYVD KRLEQLGAQRIFELGLGDDDGNLEEDFITWREQFWLAVCEHFGVEATGEESSIRQYELVVHTDI DAAKVYMGEMGRLKSYENQKPPFDAKNPFLAAVTTNRKLNQGTERHLMHLELDISDSKIRYESG DHVAVYPANDSALVNQLGKILGADLDVVMSLNNLDEESNKKHPFPCPTSYRTALTYYLDITNPP RTNVLYELAQYASEPSEQELLRKMASSSGEGKELYLSWVVEARRHILAILQDCPSLRPPIDHLC ELLPRLQARYYSIASSSKVHPNSVHICAVVVEYETKAGRINKGVATNWLRAKEPVGENGGRALV PMFVRKSQFRLPFKATTPVIMVGPGTGVAPFIGFIQERAWLRQQGKEVGETLLYYGCRRSDEDY LYREELAQFHRDGALTQLNVAFSREQSHKVYVQHLLKQDREHLWKLIEGGAHIYVCGDARNMAR DVQNTFYDIVAELGAMEHAQAVDYIKKLMTKGRYSLDVWS SEQ ID NO. 5 DNA cannabidiolic acid (CBDA) synthase Cannabis sativa ATGAATCCTCGAGAAAACTTCCTTAAATGCTTCTCGCAATATATTCCCAATAATGCAACAAATC TAAAACTCGTATACACTCAAAACAACCCATTGTATATGTCTGTCCTAAATTCGACAATACACAA TCTTAGATTCACCTCTGACACAACCCCAAAACCACTTGTTATCGTCACTCCTTCACATGTCTCT CATATCCAAGGCACTATTCTATGCTCCAAGAAAGTTGGCTTGCAGATTCGAACTCGAAGTGGTG GTCATGATTCTGAGGGCATGTCCTACATATCTCAAGTCCCATTTGTTATAGTAGACTTGAGAAA CATGCGTTCAATCAAAATAGATGTTCATAGCCAAACTGCATGGGTTGAAGCCGGAGCTACCCTT GGAGAAGTTTATTATTGGGTTAATGAGAAAAATGAGAATCTTAGTTTGGCGGCTGGGTATTGCC CTACTGTTTGCGCAGGTGGACACTTTGGTGGAGGAGGCTATGGACCATTGATGAGAAACTATGG CCTCGCGGCTGATAATATCATTGATGCACACTTAGTCAACGTTCATGGAAAAGTGCTAGATCGA AAATCTATGGGGGAAGATCTCTTTTGGGCTTTACGTGGTGGTGGAGCAGAAAGCTTCGGAATCA TTGTAGCATGGAAAATTAGACTGGTTGCTGTCCCAAAGTCTACTATGTTTAGTGTTAAAAAGAT CATGGAGATACATGAGCTTGTCAAGTTAGTTAACAAATGGCAAAATATTGCTTACAAGTATGAC AAAGATTTATTACTCATGACTCACTTCATAACTAGGAACATTACAGATAATCAAGGGAAGAATA AGACAGCAATACACACTTACTTCTCTTCAGTTTTCCTTGGTGGAGTGGATAGTCTAGTCGACTT GATGAACAAGAGTTTTCCTGAGTTGGGTATTAAAAAAACGGATTGCAGACAATTGAGCTGGATT GATACTATCATCTTCTATAGTGGTGTTGTAAATTACGACACTGATAATTTTAACAAGGAAATTT TGCTTGATAGATCCGCTGGGCAGAACGGTGCTTTCAAGATTAAGTTAGACTACGTTAAGAAACC AATTCCAGAATCTGTATTTGTCCAAATTTTGGAAAAATTATATGAAGAAGATATAGGAGCTGGG ATGTATGCGTTGTACCCTTACGGTGGTATAATGGATGAGATTTCAGAATCAGCAATTCCATTCC CTCATCGAGCTGGAATCTTGTATGAGTTATGGTACATATGTAGTTGGGAGAAGCAAGAAGATAA CGAAAAGCATCTAAACTGGATTAGAAATATTTATAACTTCATGACTCCTTATGTGTCCAAAAAT TCAAGATTGGCATATCTCAATTATAGAGACCTTGATATAGGAATAAATGATCCCAAGAATCCAA ATAATTACACACAAGCACGTATTTGGGGTGAGAAGTATTTTGGTAAAAATTTTGACAGGCTAGT AAAAGTGAAAACCCTGGTTGATCCCAATAACTTTTTTAGAAACGAACAAAGCATCCCACCTCAA CCACGGCATCGTCATTAA SEQ ID NO. 6 Amino Acid Cannabidiolic acid (CBDA) synthase Cannabis sativa MNPRENFLKCFSQYIPNNATNLKLVYTQNNPLYMSVLNSTIHNLRFTSDTTPKPLVIVTPSHVS HIQGTILCSKKVGLQIRTRSGGHDSEGMSYISQVPFVIVDLRNMRSIKIDVHSQTAWVEAGATL GEVYYWVNEKNENLSLAAGYCPTVCAGGHFGGGGYGPLMRNYGLAADNIIDAHLVNVHGKVLDR KSMGEDLFWALRGGGAESFGIIVAWKIRLVAVPKSTMFSVKKIMEIHELVKLVNKWQNIAYKYD KDLLLMTHFITRNITDNQGKNKTAIHTYFSSVFLGGVDSLVDLMNKSFPELGIKKTDCRQLSWI DTIIFYSGVVNYDTDNFNKEILLDRSAGQNGAFKIKLDYVKKPIPESVFVQILEKLYEEDIGAG MYALYPYGGIMDEISESAIPFPHRAGILYELWYICSWEKQEDNEKHLNWIRNIYNFMTPYVSKN SRLAYLNYRDLDIGINDPKNPNNYTQARIWGEKYFGKNFDRLVKVKTLVDPNNFFRNEQSIPPQ PRHRH SEQ ID NO. 7 DNA UDP glycosyltransferase 76G1 Stevia rebaudiana ATGGAAAATAAAACTGAAACTACTGTTAGAAGAAGAAGAAGAATTATTTTGTTTCCTGTTCCTT TTCAAGGACATATTAATCCTATTTTGCAATTGGCTAATGTTTTGTATTCAAAAGGATTTTCAAT TACTATTTTTCATACTAATTTTAATAAACCTAAAACTTCAAATTATCCTCATTTTACTTTTAGA TTTATTTTGGATAATGATCCTCAAGATGAAAGAATTTCAAATTTGCCTACTCATGGACCTTTGG CTGGAATGAGAATTCCTATTATTAATGAACATGGAGCTGATGAATTGAGAAGAGAATTGGAATT GTTGATGTTGGCTTCAGAAGAAGATGAAGAAGTTTCATGCTTGATTACTGATGCTTTGTGGTAT TTTGCTCAATCAGTTGCTGATTCATTGAATTTGAGAAGATTGGTTTTGATGACTTCATCATTGT TTAATTTTCATGCTCATGTTTCATTGCCTCAATTTGATGAATTGGGATATTTGGATCCTGATGA TAAAACTAGATTGGAAGAACAAGCTTCAGGATTTCCTATGTTGAAAGTTAAAGATATTAAATCA GCTTATTCAAATTGGCAAATTTTGAAAGAAATTTTGGGAAAAATGATTAAACAAACTAGAGCTT CATCAGGAGTTATTTGGAATTCATTTAAAGAATTGGAAGAATCAGAATTGGAAACTGTTATTAG AGAAATTCCTGCTCCTTCATTTTTGATTCCTTTGCCTAAACATTTGACTGCTTCATCATCATCA TTGTTGGATCATGATAGAACTGTTTTTCAATGGTTGGATCAACAACCTCCTTCATCAGTTTTGT ATGTTTCATTTGGATCAACTTCAGAAGTTGATGAAAAAGATTTTTTGGAAATTGCTAGAGGATT GGTTGATTCAAAACAATCATTTTTGTGGGTTGTTAGACCTGGATTTGTTAAAGGATCAACTTGG GTTGAACCTTTGCCTGATGGATTTTTGGGAGAAAGAGGAAGAATTGTTAAATGGGTTCCTCAAC AAGAAGTTTTGGCTCATGGAGCTATTGGAGCTTTTTGGACTCATTCAGGATGGAATTCAACTTT GGAATCAGTTTGCGAAGGAGTTCCTATGATTTTTTCAGATTTTGGATTGGATCAACCTTTGAAT GCTAGATATATGTCAGATGTTTTGAAAGTTGGAGTTTATTTGGAAAATGGATGGGAAAGAGGAG AAATTGCTAATGCTATTAGAAGAGTTATGGTTGATGAAGAAGGAGAATATATTAGACAAAATGC TAGAGTTTTGAAACAAAAAGCTGATGTTTCATTGATGAAAGGAGGATCATCATATGAATCATTG GAATCATTGGTTTCATATATTTCATCATTG SEQ ID NO. 8 Amino Acid UPD gycosyltransferase 76G1 Stevia rebaudiana MENKTETTVRRRRRIILFPVPFQGHINPILQLANVLYSKGFSITIFHTNFNKPKTSNYPHFTFR FILDNDPQDERISNLPTHGPLAGMRIPIINEHGADELRRELELLMLASEEDEEVSCLITDALWY FAQSVADSLNLRRLVLMTSSLFNFHAHVSLPQFDELGYLDPDDKTRLEEQASGFPMLKVKDIKS AYSNWQILKEILGKMIKQTRASSGVIWNSFKELEESELETVIREIPAPSFLIPLPKHLTASSSS LLDHDRTVFQWLDQQPPSSVLYVSFGSTSEVDEKDFLEIARGLVDSKQSFLWVVRPGFVKGSTW VEPLPDGFLGERGRIVKWVPQQEVLAHGAIGAFWTHSGWNSTLESVCEGVPMIFSDFGLDQPLN ARYMSDVLKVGVYLENGWERGEIANAIRRVMVDEEGEYIRQNARVLKQKADVSLMKGGSSYESL ESLVSYISSL SEQ ID NO. 9 DNA ABC transporter ABCG2 Human ATGTCATCATCAAATGTTGAAGTTTTTATTCCTGTTTCACAAGGAAATACTAATGGATTTCCTG CTACTGCTTCAAATGATTTGAAAGCTTTTACTGAAGGAGCTGTTTTGTCATTTCATAATATTTG CTATAGAGTTAAATTGAAATCAGGATTTTTGCCTTGCAGAAAACCTGTTGAAAAAGAAATTTTG TCAAATATTAATGGAATTATGAAACCTGGATTGAATGCTATTTTGGGACCTACTGGAGGAGGAA AATCATCATTGTTGGATGTTTTGGCTGCTAGAAAAGATCCTTCAGGATTGTCAGGAGATGTTTT GATTAATGGAGCTCCTAGACCTGCTAATTTTAAATGCAATTCAGGATATGTTGTTCAAGATGAT GTTGTTATGGGAACTTTGACTGTTAGAGAAAATTTGCAATTTTCAGCTGCTTTGAGATTGGCTA CTACTATGACTAATCATGAAAAAAATGAAAGAATTAATAGAGTTATTCAAGAATTGGGATTGGA TAAAGTTGCTGATTCAAAAGTTGGAACTCAATTTATTAGAGGAGTTTCAGGAGGAGAAAGAAAA AGAACTTCAATTGGAATGGAATTGATTACTGATCCTTCAATTTTGTTTTTGGATGAACCTACTA CTGGATTGGATTCATCAACTGCTAATGCTGTTTTGTTGTTGTTGAAAAGAATGTCAAAACAAGG AAGAACTATTATTTTTTCAATTCATCAACCTAGATATTCAATTTTTAAATTGTTTGATTCATTG ACTTTGTTGGCTTCAGGAAGATTGATGTTTCATGGACCTGCTCAAGAAGCTTTGGGATATTTTG AATCAGCTGGATATCATTGCGAAGCTTATAATAATCCTGCTGATTTTTTTTTGGATATTATTAA TGGAGATTCAACTGCTGTTGCTTTGAATAGAGAAGAAGATTTTAAAGCTACTGAAATTATTGAA CCTTCAAAACAAGATAAACCTTTGATTGAAAAATTGGCTGAAATTTATGTTAATTCATCATTTT ATAAAGAAACTAAAGCTGAATTGCATCAATTGTCAGGAGGAGAAAAAAAAAAAAAAATTACTGT TTTTAAAGAAATTTCATATACTACTTCATTTTGCCATCAATTGAGATGGGTTTCAAAAAGATCA TTTAAAAATTTGTTGGGAAATCCTCAAGCTTCAATTGCTCAAATTATTGTTACTGTTGTTTTGG GATTGGTTATTGGAGCTATTTATTTTGGATTGAAAAATGATTCAACTGGAATTCAAAATAGAGC TGGAGTTTTGTTTTTTTTGACTACTAATCAATGCTTTTCATCAGTTTCAGCTGTTGAATTGTTT GTTGTTGAAAAAAAATTGTTTATTCATGAATATATTTCAGGATATTATAGAGTTTCATCATATT TTTTGGGAAAATTGTTGTCAGATTTGTTGCCTATGAGAATGTTGCCTTCAATTATTTTTACTTG CATTGTTTATTTTATGTTGGGATTGAAAGCTAAAGCTGATGCTTTTTTTGTTATGATGTTTACT TTGATGATGGTTGCTTATTCAGCTTCATCAATGGCTTTGGCTATTGCTGCTGGACAATCAGTTG TTTCAGTTGCTACTTTGTTGATGACTATTTGCTTTGTTTTTATGATGATTTTTTCAGGATTGTT GGTTAATTTGACTACTATTGCTTCATGGTTGTCATGGTTGCAATATTTTTCAATTCCTAGATAT GGATTTACTGCTTTGCAACATAATGAATTTTTGGGACAAAATTTTTGCCCTGGATTGAATGCTA CTGGAAATAATCCTTGCAATTATGCTACTTGCACTGGAGAAGAATATTTGGTTAAACAAGGAAT TGATTTGTCACCTTGGGGATTGTGGAAAAATCATGTTGCTTTGGCTTGCATGATTGTTATTTTT TTGACTATTGCTTATTTGAAATTGTTGTTTTTGAAAAAATATTCA SEQ ID NO. 10 Amino Acid ABC transporter ABCG2 Human MSSSNVEVFIPVSQGNTNGFPATASNDLKAFTEGAVLSFHNICYRVKLKSGFLPCRKPVEKEIL SNINGIMKPGLNAILGPTGGGKSSLLDVLAARKDPSGLSGDVLINGAPRPANFKCNSGYVVQDD VVMGTLTVRENLQFSAALRLATTMTNHEKNERINRVIQELGLDKVADSKVGTQFIRGVSGGERK RTSIGMELITDPSILFLDEPTTGLDSSTANAVLLLLKRMSKQGRTIIFSIHQPRYSIFKLFDSL TLLASGRLMFHGPAQEALGYFESAGYHCEAYNNPADFFLDIINGDSTAVALNREEDFKATEIIE PSKQDKPLIEKLAEIYVNSSFYKETKAELHQLSGGEKKKKITVFKEISYTTSFCHQLRWVSKRS FKNLLGNPQASIAQIIVTVVLGLVIGAIYFGLKNDSTGIQNRAGVLFFLTTNQCFSSVSAVELF VVEKKLFIHEYISGYYRVSSYFLGKLLSDLLPMRMLPSIIFTCIVYFMLGLKAKADAFFVMMFT LMMVAYSASSMALAIAAGQSVVSVATLLMTICFVFMMIFSGLLVNLTTIASWLSWLQYFSIPRY GFTALQHNEFLGQNFCPGLNATGNNPCNYATCTGEEYLVKQGIDLSPWGLWKNHVALACMIVIF LTIAYLKLLFLKKYS SEQ ID NO. 11 DNA MYB12-like Cannabis ATGAAGAAGAACAAATCAACTAGTAATAATAAGAACAACAACAGTAATAATATCATCAAAAACG ACATCGTATCATCATCATCATCAACAACAACAACATCATCAACAACTACAGCAACATCATCATT TCATAATGAGAAAGTTACTGTCAGTACTGATCATATTATTAATCTTGATGATAAGCAGAAACGA CAATTATGTCGTTGTCGTTTAGAAAAAGAAGAAGAAGAAGAAGGAAGTGGTGGTTGTGGTGAGA CAGTAGTAATGATGCTAGGGTCAGTATCTCCTGCTGCTGCTACTGCTGCTGCAGCTGGGGGCTC ATCAAGTTGTGATGAAGACATGTTGGGTGGTCATGATCAACTGTTGTTGTTGTGTTGTTCTGAG AAAAAAACGACAGAAATTTCATCAGTGGTGAACTTTAATAATAATAATAATAATAATAAGGAAA ATGGTGACGAAGTTTCAGGACCGTACGATTATCATCATCATAAAGAAGAGGAAGAAGAAGAAGA AGAAGATGAAGCATCTGCATCAGTAGCAGCTGTTGATGAAGGGATGTTGTTGTGCTTTGATGAC ATAATAGATAGCCACTTGCTAAATCCAAATGAGGTTTTGACTTTAAGAGAAGATAGCCATAATG AAGGTGGGGCAGCTGATCAGATTGACAAGACTACTTGTAATAATACTACTATTACTACTAATGA TGATTATAACAATAACTTGATGATGTTGAGCTGCAATAATAACGGAGATTATGTTATTAGTGAT GATCATGATGATCAGTACTGGATAGACGACGTCGTTGGAGTTGACTTTTGGAGTTGGGAGAGTT CGACTACTACTGTTATTACCCAAGAACAAGAACAAGAACAAGATCAAGTTCAAGAACAGAAGAA TATGTGGGATAATGAGAAAGAGAAACTGTTGTCTTTGCTATGGGATAATAGTGATAACAGCAGC AGTTGGGAGTTACAAGATAAAAGCAATAATAATAATAATAATAATGTTCCTAACAAATGTCAAG AGATTACCTCTGATAAAGAAAATGCTATGGTTGCATGGCTTCTCTCCTGA SEQ ID NO. 12 Amino Acid MYB12 Cannabis MKKNKSTSNNKNNNSNNIIKNDIVSSSSSTTTTSSTTTATSSFHNEKVTVSTDHIINLDDKQKR QLCRCRLEKEEEEEGSGGCGETVVMMLGSVSPAAATAAAAGGSSSCDEDMLGGHDQLLLLCCSE KKTTEISSVVNFNNNNNNNKENGDEVSGPYDYHHHKEEEEEEEEDEASASVAAVDEGMLLCFDD IIDSHLLNPNEVLTLREDSHNEGGAADQIDKTTCNNTTITTNDDYNNNLMMLSCNNNGDYVISD DHDDQYWIDDVVGVDFWSWESSTTTVITQEQEQEQDQVQEQKNMWDNEKEKLLSLLWDNSDNSS SWELQDKSNNNNNNNVPNKCQEITSDKENAMVAWLLS SEQ ID NO. 13 DNA Catalase Arabidopsis thaliana ATGGATCCTTATAAATATAGACCTGCTTCATCATATAATTCACCTTTTTTTACTACTAATTCAG GAGCTCCTGTTTGGAATAATAATTCATCAATGACTGTTGGACCTAGAGGATTGATTTTGTTGGA AGATTATCATTTGGTTGAAAAATTGGCTAATTTTGATAGAGAAAGAATTCCTGAAAGAGTTGTT CATGCTAGAGGAGCTTCAGCTAAAGGATTTTTTGAAGTTACTCATGATATTTCAAATTTGACTT GCGCTGATTTTTTGAGAGCTCCTGGAGTTCAAACTCCTGTTATTGTTAGATTTTCAACTGTTAT TCATGCTAGAGGATCACCTGAAACTTTGAGAGATCCTAGAGGATTTGCTGTTAAATTTTATACT AGAGAAGGAAATTTTGATTTGGTTGGAAATAATTTTCCTGTTTTTTTTATTAGAGATGGAATGA AATTTCCTGATATTGTTCATGCTTTGAAACCTAATCCTAAATCACATATTCAAGAAAATTGGAG AATTTTGGATTTTTTTTCACATCATCCTGAATCATTGAATATGTTTACTTTTTTGTTTGATGAT ATTGGAATTCCTCAAGATTATAGACATATGGATGGATCAGGAGTTAATACTTATATGTTGATTA ATAAAGCTGGAAAAGCTCATTATGTTAAATTTCATTGGAAACCTACTTGCGGAGTTAAATCATT GTTGGAAGAAGATGCTATTAGATTGGGAGGAACTAATCATTCACATGCTACTCAAGATTTGTAT GATTCAATTGCTGCTGGAAATTATCCTGAATGGAAATTGTTTATTCAAATTATTGATCCTGCTG ATGAAGATAAATTTGATTTTGATCCTTTGGATGTTACTAAAACTTGGCCTGAAGATATTTTGCC TTTGCAACCTGTTGGAAGAATGGTTTTGAATAAAAATATTGATAATTTTTTTGCTGAAAATGAA CAATTGGCTTTTTGCCCTGCTATTATTGTTCCTGGAATTCATTATTCAGATGATAAATTGTTGC AAACTAGAGTTTTTTCATATGCTGATACTCAAAGACATAGATTGGGACCTAATTATTTGCAATT GCCTGTTAATGCTCCTAAATGCGCTCATCATAATAATCATCATGAAGGATTTATGAATTTTATG CATAGAGATGAAGAAGTTAATTATTTTCCTTCAAGATATGATCAAGTTAGACATGCTGAAAAAT ATCCTACTCCTCCTGCTGTTTGCTCAGGAAAAAGAGAAAGATGCATTATTGAAAAAGAAAATAA TTTTAAAGAACCTGGAGAAAGATATAGAACTTTTACTCCTGAAAGACAAGAAAGATTTATTCAA AGATGGATTGATGCTTTGTCAGATCCTAGAATTACTCATGAAATTAGATCAATTTGGATTTCAT ATTGGTCACAAGCTGATAAATCATTGGGACAAAAATTGGCTTCAAGATTGAATGTTAGACCTTC AATT SEQ ID NO. 14 Amino Acid Catalase Arabidopsis thaliana MDPYKYRPASSYNSPFFTTNSGAPVWNNNSSMTVGPRGLILLEDYHLVEKLANFDRERIPERVV HARGASAKGFFEVTHDISNLTCADFLRAPGVQTPVIVRFSTVIHARGSPETLRDPRGFAVKFYT REGNFDLVGNNFPVFFIRDGMKFPDIVHALKPNPKSHIQENWRILDFFSHHPESLNMFTFLFDD IGIPQDYRHMDGSGVNTYMLINKAGKAHYVKFHWKPTCGVKSLLEEDAIRLGGTNHSHATQDLY DSIAAGNYPEWKLFIQIIDPADEDKFDFDPLDVTKTWPEDILPLQPVGRMVLNKNIDNFFAENE QLAFCPAIIVPGIHYSDDKLLQTRVFSYADTQRHRLGPNYLQLPVNAPKCAHHNNHHEGFMNFM HRDEEVNYFPSRYDQVRHAEKYPTPPAVCSGKRERCIIEKENNFKEPGERYRTFTPERQERFIQ RWIDALSDPRITHEIRSIWISYWSQADKSLGQKLASRLNVRPSI SEQ ID NO. 15 DNA Catalase HPII (KatE) Escherichia coli ATGTCGCAACATAACGAAAAGAACCCACATCAGCACCAGTCACCACTACACGATTCCAGCGAAG CGAAACCGGGGATGGACTCACTGGCACCTGAGGACGGCTCTCATCGTCCAGCGGCTGAACCAAC ACCGCCAGGTGCACAACCTACCGCCCCAGGGAGCCTGAAAGCCCCTGATACGCGTAACGAAAAA CTTAATTCTCTGGAAGACGTACGCAAAGGCAGTGAAAATTATGCGCTGACCACTAATCAGGGCG TGCGCATCGCCGACGATCAAAACTCACTGCGTGCCGGTAGCCGTGGTCCAACGCTGCTGGAAGA TTTTATTCTGCGCGAGAAAATCACCCACTTTGACCATGAGCGCATTCCGGAACGTATTGTTCAT GCACGCGGATCAGCCGCTCACGGTTATTTCCAGCCATATAAAAGCTTAAGCGATATTACCAAAG CGGATTTCCTCTCAGATCCGAACAAAATCACCCCAGTATTTGTACGTTTCTCTACCGTTCAGGG TGGTGCTGGCTCTGCTGATACCGTGCGTGATATCCGTGGCTTTGCCACCAAGTTCTATACCGAA GAGGGTATTTTTGACCTCGTTGGCAATAACACGCCAATCTTCTTTATCCAGGATGCGCATAAAT TCCCCGATTTTGTTCATGCGGTAAAACCAGAACCGCACTGGGCAATTCCACAAGGGCAAAGTGC CCACGATACTTTCTGGGATTATGTTTCTCTGCAACCTGAAACTCTGCACAACGTGATGTGGGCG ATGTCGGATCGCGGCATCCCCCGCAGTTACCGCACCATGGAAGGCTTCGGTATTCACACCTTCC GCCTGATTAATGCCGAAGGGAAGGCAACGTTTGTACGTTTCCACTGGAAACCACTGGCAGGTAA AGCCTCACTCGTTTGGGATGAAGCACAAAAACTCACCGGACGTGACCCGGACTTCCACCGCCGC GAGTTGTGGGAAGCCATTGAAGCAGGCGATTTTCCGGAATACGAACTGGGCTTCCAGTTGATTC CTGAAGAAGATGAATTCAAGTTCGACTTCGATCTTCTCGATCCAACCAAACTTATCCCGGAAGA ACTGGTGCCCGTTCAGCGTGTCGGCAAAATGGTGCTCAATCGCAACCCGGATAACTTCTTTGCT GAAAACGAACAGGCGGCTTTCCATCCTGGGCATATCGTGCCGGGACTGGACTTCACCAACGATC CGCTGTTGCAGGGACGTTTGTTCTCCTATACCGATACACAAATCAGTCGTCTTGGTGGGCCGAA TTTCCATGAGATTCCGATTAACCGTCCGACCTGCCCTTACCATAATTTCCAGCGTGACGGCATG CATCGCATGGGGATCGACACTAACCCGGCGAATTACGAACCGAACTCGATTAACGATAACTGGC CGCGCGAAACACCGCCGGGGCCGAAACGCGGCGGTTTTGAATCATACCAGGAGCGCGTGGAAGG CAATAAAGTTCGCGAGCGCAGCCCATCGTTTGGCGAATATTATTCCCATCCGCGTCTGTTCTGG CTAAGTCAGACGCCATTTGAGCAGCGCCATATTGTCGATGGTTTCAGTTTTGAGTTAAGCAAAG TCGTTCGTCCGTATATTCGTGAGCGCGTTGTTGACCAGCTGGCGCATATTGATCTCACTCTGGC CCAGGCGGTGGCGAAAAATCTCGGTATCGAACTGACTGACGACCAGCTGAATATCACCCCACCT CCGGACGTCAACGGTCTGAAAAAGGATCCATCCTTAAGTTTGTACGCCATTCCTGACGGTGATG TGAAAGGTCGCGTGGTAGCGATTTTACTTAATGATGAAGTGAGATCGGCAGACCTTCTGGCCAT TCTCAAGGCGCTGAAGGCCAAAGGCGTTCATGCCAAACTGCTCTACTCCCGAATGGGTGAAGTG ACTGCGGATGACGGTACGGTGTTGCCTATAGCCGCTACCTTTGCCGGTGCACCTTCGCTGACGG TCGATGCGGTCATTGTCCCTTGCGGCAATATCGCGGATATCGCTGACAACGGCGATGCCAACTA CTACCTGATGGAAGCCTACAAACACCTTAAACCGATTGCGCTGGCGGGTGACGCGCGCAAGTTT AAAGCAACAATCAAGATCGCTGACCAGGGTGAAGAAGGGATTGTGGAAGCTGACAGCGCTGACG GTAGTTTTATGGATGAACTGCTAACGCTGATGGCAGCACACCGCGTGTGGTCACGCATTCCTAA GATTGACAAAATTCCTGCCTGA SEQ ID NO. 16 Amino Acid Catalase HPII (KatE) Escherichia coli MSQHNEKNPHQHQSPLHDSSEAKPGMDSLAPEDGSHRPAAEPTPPGAQPTAPGSLKAPDTRNEK LNSLEDVRKGSENYALTTNQGVRIADDQNSLRAGSRGPTLLEDFILREKITHFDHERIPERIVH ARGSAAHGYFQPYKSLSDITKADFLSDPNKITPVFVRFSTVQGGAGSADTVRDIRGFATKFYTE EGIFDLVGNNTPIFFIQDAHKFPDFVHAVKPEPHWAIPQGQSAHDTFWDYVSLQPETLHNVMWA MSDRGIPRSYRTMEGFGIHTFRLINAEGKATFVRFHWKPLAGKASLVWDEAQKLTGRDPDFHRR ELWEAIEAGDFPEYELGFQLIPEEDEFKFDFDLLDPTKLIPEELVPVQRVGKMVLNRNPDNFFA ENEQAAFHPGHIVPGLDFTNDPLLQGRLFSYTDTQISRLGGPNFHEIPINRPTCPYHNFQRDGM HRMGIDTNPANYEPNSINDNWPRETPPGPKRGGFESYQERVEGNKVRERSPSFGEYYSHPRLFW LSQTPFEQRHIVDGFSFELSKVVRPYIRERVVDQLAHIDLTLAQAVAKNLGIELTDDQLNITPP PDVNGLKKDPSLSLYAIPDGDVKGRVVAILLNDEVRSADLLAILKALKAKGVHAKLLYSRMGEV TADDGTVLPIAATFAGAPSLTVDAVIVPCGNIADIADNGDANYYLMEAYKHLKPIALAGDARKF KATIKIADQGEEGIVEADSADGSFMDELLTLMAAHRVWSRIPKIDKIPA SEQ ID NO. 17 DNA Trichome-targeted CBDA synthase Cannabis ATGAAGTGCTCAACATTCTCCTTTTGGTTTGTTTGCAAGATAATATTTTTCTTTTTCTCATTCA ATATCCAAACTTCCATTGCTAATCCTCGAGAAAACTTCCTTAAATGCTTCTCGCAATATATTCC CAATAATGCAACAAATCTAAAACTCGTATACACTCAAAACAACCCATTGTATATGTCTGTCCTA AATTCGACAATACACAATCTTAGATTCACCTCTGACACAACCCCAAAACCACTTGTTATCGTCA CTCCTTCACATGTCTCTCATATCCAAGGCACTATTCTATGCTCCAAGAAAGTTGGCTTGCAGAT TCGAACTCGAAGTGGTGGTCATGATTCTGAGGGCATGTCCTACATATCTCAAGTCCCATTTGTT ATAGTAGACTTGAGAAACATGCGTTCAATCAAAATAGATGTTCATAGCCAAACTGCATGGGTTG AAGCCGGAGCTACCCTTGGAGAAGTTTATTATTGGGTTAATGAGAAAAATGAGAATCTTAGTTT GGCGGCTGGGTATTGCCCTACTGTTTGCGCAGGTGGACACTTTGGTGGAGGAGGCTATGGACCA TTGATGAGAAACTATGGCCTCGCGGCTGATAATATCATTGATGCACACTTAGTCAACGTTCATG GAAAAGTGCTAGATCGAAAATCTATGGGGGAAGATCTCTTTTGGGCTTTACGTGGTGGTGGAGC AGAAAGCTTCGGAATCATTGTAGCATGGAAAATTAGACTGGTTGCTGTCCCAAAGTCTACTATG TTTAGTGTTAAAAAGATCATGGAGATACATGAGCTTGTCAAGTTAGTTAACAAATGGCAAAATA TTGCTTACAAGTATGACAAAGATTTATTACTCATGACTCACTTCATAACTAGGAACATTACAGA TAATCAAGGGAAGAATAAGACAGCAATACACACTTACTTCTCTTCAGTTTTCCTTGGTGGAGTG GATAGTCTAGTCGACTTGATGAACAAGAGTTTTCCTGAGTTGGGTATTAAAAAAACGGATTGCA GACAATTGAGCTGGATTGATACTATCATCTTCTATAGTGGTGTTGTAAATTACGACACTGATAA TTTTAACAAGGAAATTTTGCTTGATAGATCCGCTGGGCAGAACGGTGCTTTCAAGATTAAGTTA GACTACGTTAAGAAACCAATTCCAGAATCTGTATTTGTCCAAATTTTGGAAAAATTATATGAAG AAGATATAGGAGCTGGGATGTATGCGTTGTACCCTTACGGTGGTATAATGGATGAGATTTCAGA ATCAGCAATTCCATTCCCTCATCGAGCTGGAATCTTGTATGAGTTATGGTACATATGTAGTTGG GAGAAGCAAGAAGATAACGAAAAGCATCTAAACTGGATTAGAAATATTTATAACTTCATGACTC CTTATGTGTCCAAAAATCCAAGATTGGCATATCTCAATTATAGAGACCTTGATATAGGAATAAA TGATCCCAAGAATCCAAATAATTACACACAAGCACGTATTTGGGGTGAGAAGTATTTTGGTAAA AATTTTGACAGGCTAGTAAAAGTGAAAACCCTGGTTGATCCCAATAACTTTTTTAGAAACGAAC AAAGCATCCCACCTCTACCACGGCATCGTCATTAA SEQ ID NO. 18 Amino Acid Trichome-targeted CBDA synthase Cannabis MKCSTFSFWFVCKIIFFFFSFNIQTSIANPRENFLKCFSQYIPNNATNLKLVYTQNNPLYMSVL NSTIHNLRFTSDTTPKPLVIVTPSHVSHIQGTILCSKKVGLQIRTRSGGHDSEGMSYISQVPFV IVDLRNMRSIKIDVHSQTAWVEAGATLGEVYYWVNEKNENLSLAAGYCPTVCAGGHFGGGGYGP LMRNYGLAADNIIDAHLVNVHGKVLDRKSMGEDLFWALRGGGAESFGIIVAWKIRLVAVPKSTM FSVKKIMEIHELVKLVNKWQNIAYKYDKDLLLMTHFITRNITDNQGKNKTAIHTYFSSVFLGGV DSLVDLMNKSFPELGIKKTDCRQLSWIDTIIFYSGVVNYDTDNFNKEILLDRSAGQNGAFKIKL DYVKKPIPESVFVQILEKLYEEDIGAGMYALYPYGGIMDEISESAIPFPHRAGILYELWYICSW EKQEDNEKHLNWIRNIYNFMTPYVSKNPRLAYLNYRDLDIGINDPKNPNNYTQARIWGEKYFGK NFDRLVKVKTLVDPNNFFRNEQSIPPLPRHRH SEQ ID NO. 19 DNA Trichome-targeted UDP glycosyltransferase 76G1 Stevia rebaudiana ATGAAGTGCTCAACATTCTCCTTTTGGTTTGTTTGCAAGATAATATTTTTCTTTTTCTCATTCA ATATCCAAACTTCCATTGCTAATCCTCGAGAAAATAAAACTGAAACTACTGTTAGAAGAAGAAG AAGAATTATTTTGTTTCCTGTTCCTTTTCAAGGACATATTAATCCTATTTTGCAATTGGCTAAT GTTTTGTATTCAAAAGGATTTTCAATTACTATTTTTCATACTAATTTTAATAAACCTAAAACTT CAAATTATCCTCATTTTACTTTTAGATTTATTTTGGATAATGATCCTCAAGATGAAAGAATTTC AAATTTGCCTACTCATGGACCTTTGGCTGGAATGAGAATTCCTATTATTAATGAACATGGAGCT GATGAATTGAGAAGAGAATTGGAATTGTTGATGTTGGCTTCAGAAGAAGATGAAGAAGTTTCAT GCTTGATTACTGATGCTTTGTGGTATTTTGCTCAATCAGTTGCTGATTCATTGAATTTGAGAAG ATTGGTTTTGATGACTTCATCATTGTTTAATTTTCATGCTCATGTTTCATTGCCTCAATTTGAT GAATTGGGATATTTGGATCCTGATGATAAAACTAGATTGGAAGAACAAGCTTCAGGATTTCCTA TGTTGAAAGTTAAAGATATTAAATCAGCTTATTCAAATTGGCAAATTTTGAAAGAAATTTTGGG AAAAATGATTAAACAAACTAGAGCTTCATCAGGAGTTATTTGGAATTCATTTAAAGAATTGGAA GAATCAGAATTGGAAACTGTTATTAGAGAAATTCCTGCTCCTTCATTTTTGATTCCTTTGCCTA AACATTTGACTGCTTCATCATCATCATTGTTGGATCATGATAGAACTGTTTTTCAATGGTTGGA TCAACAACCTCCTTCATCAGTTTTGTATGTTTCATTTGGATCAACTTCAGAAGTTGATGAAAAA GATTTTTTGGAAATTGCTAGAGGATTGGTTGATTCAAAACAATCATTTTTGTGGGTTGTTAGAC CTGGATTTGTTAAAGGATCAACTTGGGTTGAACCTTTGCCTGATGGATTTTTGGGAGAAAGAGG AAGAATTGTTAAATGGGTTCCTCAACAAGAAGTTTTGGCTCATGGAGCTATTGGAGCTTTTTGG ACTCATTCAGGATGGAATTCAACTTTGGAATCAGTTTGCGAAGGAGTTCCTATGATTTTTTCAG ATTTTGGATTGGATCAACCTTTGAATGCTAGATATATGTCAGATGTTTTGAAAGTTGGAGTTTA TTTGGAAAATGGATGGGAAAGAGGAGAAATTGCTAATGCTATTAGAAGAGTTATGGTTGATGAA GAAGGAGAATATATTAGACAAAATGCTAGAGTTTTGAAACAAAAAGCTGATGTTTCATTGATGA AAGGAGGATCATCATATGAATCATTGGAATCATTGGTTTCATATATTTCATCATTGTAA SEQ ID NO. 20 Amino Acid Trichome-targeted UDP glycosyltransferase 76G1 Stevia rebaudiana MKCSTFSFWFVCKIIFFFFSFNIQTSIANPRENKTETTVRRRRRIILFPVPFQGHINPILQLAN VLYSKGFSITIFHTNFNKPKTSNYPHFTFRFILDNDPQDERISNLPTHGPLAGMRIPIINEHGA DELRRELELLMLASEEDEEVSCLITDALWYFAQSVADSLNLRRLVLMTSSLFNFHAHVSLPQFD ELGYLDPDDKTRLEEQASGFPMLKVKDIKSAYSNWQILKEILGKMIKQTRASSGVIWNSFKELE ESELETVIREIPAPSFLIPLPKHLTASSSSLLDHDRTVFQWLDQQPPSSVLYVSFGSTSEVDEK DFLEIARGLVDSKQSFLWVVRPGFVKGSTWVEPLPDGFLGERGRIVKWVPQQEVLAHGAIGAFW THSGWNSTLESVCEGVPMIFSDFGLDQPLNARYMSDVLKVGVYLENGWERGEIANAIRRVMVDE EGEYIRQNARVLKQKADVSLMKGGSSYESLESLVSYISSL SEQ ID NO. 21 DNA PM-UTR1 Arabidopsis thaliana ATGGAGGTCCATGGCTCCGGATTCCGTCGAATTCTGTTGTTGGCGTTGTGTATCTCCGGGATCT GGTCCGCCTACATCTACCAAGGCGTTCTTCAAGAGACTCTGTCCACGAAGAGATTTGGTCCAGA TGAGAAGAGGTTCGAGCATCTTGCATTCTTGAACTTAGCTCAAAGTGTAGTCTGCTTGATCTGG TCTTATATAATGATCAAGCTCTGGTCAAATGCTGGTAACGGTGGAGCACCATGGTGGACGTATT GGAGTGCAGGCATTACTAATACAATTGGTCCTGCCATGGGAATTGAAGCCTTGAAGTATATCAG TTATCCAGCTCAGGTTTTGGCAAAATCGTCAAAAATGATTCCAGTTATGCTAATGGGAACTTTA GTTTACGGAATAAGATACACTTTCCCTGAATACATGTGCACCTTTCTTGTCGCTGGAGGAGTAT CCATCTTTGCTCTTCTTAAGACAAGCTCTAAGACAATTAGCAAGCTAGCACATCCAAATGCTCC CCTCGGTTACGCACTTTGTTCCTTAAACCTCGCCTTTGACGGATTCACAAATGCCACACAAGAC TCCATTGCCTCAAGGTACCCAAAAACCGAAGCGTGGGACATAATGCTGGGAATGAACTTATGGG GCACAATATACAACATTATCTACATGTTTGGCTTGCCACAAGGGATGGATTCGAAGCAATTCAG TTCTGTAAGCTACACCCGGAAGCGGCATGGGACATTCTAAAGTATTGTATATGCGGTGCCGTGG GACAAAACTTCATCTTCATGACAATAAGTAACTTCGGGTCACTAGCTAACACGACCATAACCAC GACCAGGAAGTTTGTTAGCATTGTTGTATCATCAGTAATGAGCGGAAATCCATTGTCGTTGAAG CAATGGGGATGTGTTTCGATGGTCTTTGGTGGTTTGGCATATCAAATTTATCTTAAATGGAAGA AATTGCAGAGAGTGGAGTGCTCCATAATGAACTTAATGTGTGGGTCTACCTGCGCCGCTTGA SEQ ID NO. 22 DNA Cytostolic CBDA synthase (cytCBDAs) Cannabis sativa ATGAATCCTCGAGAAAACTTCCTTAAATGCTTCTCGCAATATATTCCCAATAATGCAACAAATC TAAAACTCGTATACACTCAAAACAACCCATTGTATATGTCTGTCCTAAATTCGACAATACACAA TCTTAGATTCACCTCTGACACAACCCCAAAACCACTTGTTATCGTCACTCCTTCACATGTCTCT CATATCCAAGGCACTATTCTATGCTCCAAGAAAGTTGGCTTGCAGATTCGAACTCGAAGTGGTG GTCATGATTCTGAGGGCATGTCCTACATATCTCAAGTCCCATTTGTTATAGTAGACTTGAGAAA CATGCGTTCAATCAAAATAGATGTTCATAGCCAAACTGCATGGGTTGAAGCCGGAGCTACCCTT GGAGAAGTTTATTATTGGGTTAATGAGAAAAATGAGAATCTTAGTTTGGCGGCTGGGTATTGCC CTACTGTTTGCGCAGGTGGACACTTTGGTGGAGGAGGCTATGGACCATTGATGAGAAACTATGG CCTCGCGGCTGATAATATCATTGATGCACACTTAGTCAACGTTCATGGAAAAGTGCTAGATCGA AAATCTATGGGGGAAGATCTCTTTTGGGCTTTACGTGGTGGTGGAGCAGAAAGCTTCGGAATCA TTGTAGCATGGAAAATTAGACTGGTTGCTGTCCCAAAGTCTACTATGTTTAGTGTTAAAAAGAT CATGGAGATACATGAGCTTGTCAAGTTAGTTAACAAATGGCAAAATATTGCTTACAAGTATGAC AAAGATTTATTACTCATGACTCACTTCATAACTAGGAACATTACAGATAATCAAGGGAAGAATA AGACAGCAATACACACTTACTTCTCTTCAGTTTTCCTTGGTGGAGTGGATAGTCTAGTCGACTT GATGAACAAGAGTTTTCCTGAGTTGGGTATTAAAAAAACGGATTGCAGACAATTGAGCTGGATT GATACTATCATCTTCTATAGTGGTGTTGTAAATTACGACACTGATAATTTTAACAAGGAAATTT TGCTTGATAGATCCGCTGGGCAGAACGGTGCTTTCAAGATTAAGTTAGACTACGTTAAGAAACC AATTCCAGAATCTGTATTTGTCCAAATTTTGGAAAAATTATATGAAGAAGATATAGGAGCTGGG ATGTATGCGTTGTACCCTTACGGTGGTATAATGGATGAGATTTCAGAATCAGCAATTCCATTCC CTCATCGAGCTGGAATCTTGTATGAGTTATGGTACATATGTAGTTGGGAGAAGCAAGAAGATAA CGAAAAGCATCTAAACTGGATTAGAAATATTTATAACTTCATGACTCCTTATGTGTCCAAAAAT CCAAGATTGGCATATCTCAATTATAGAGACCTTGATATAGGAATAAATGATCCCAAGAATCCAA ATAATTACACACAAGCACGTATTTGGGGTGAGAAGTATTTTGGTAAAAATTTTGACAGGCTAGT AAAAGTGAAAACCCTGGTTGATCCCAATAACTTTTTTAGAAACGAACAAAGCATCCCACCTCTA CCACGGCATCGTCATTAA SEQ ID NO. 23 Amino Acid Cytostolic CBDA synthase (cytCBDAs) Cannabis sativa MNPRENFLKCFSQYIPNNATNLKLVYTQNNPLYMSVLNSTIHNLRFTSDTTPKPLVIVTPSHVS HIQGTILCSKKVGLQIRTRSGGHDSEGMSYISQVPFVIVDLRNMRSIKIDVHSQTAWVEAGATL GEVYYWVNEKNENLSLAAGYCPTVCAGGHFGGGGYGPLMRNYGLAADNIIDAHLVNVHGKVLDR KSMGEDLFWALRGGGAESFGIIVAWKIRLVAVPKSTMFSVKKIMEIHELVKLVNKWQNIAYKYD KDLLLMTHFITRNITDNQGKNKTAIHTYFSSVFLGGVDSLVDLMNKSFPELGIKKTDCRQLSWI DTIIFYSGVVNYDTDNFNKEILLDRSAGQNGAFKIKLDYVKKPIPESVFVQILEKLYEEDIGAG MYALYPYGGIMDEISESAIPFPHRAGILYELWYICSWEKQEDNEKHLNWIRNIYNFMTPYVSKN PRLAYLNYRDLDIGINDPKNPNNYTQARIWGEKYFGKNFDRLVKVKTLVDPNNFFRNEQSIPPL PRHRH SEQ ID NO. 24 DNA Cytostolic-targeted UDP glycosyltransferase 76G1 (cytUTG) Stevia rebaudiana ATGGAAAATAAAACCGAAACCACCGTCCGCCGTCGTCGCCGTATCATTCTGTTCCCGGTCCCGT TCCAGGGCCACATCAACCCGATTCTGCAACTGGCGAACGTGCTGTATTCGAAAGGTTTCAGCAT CACCATCTTCCATACGAACTTCAACAAGCCGAAGACCAGCAATTACCCGCACTTTACGTTCCGT TTTATTCTGGATAACGACCCGCAGGATGAACGCATCTCTAATCTGCCGACCCACGGCCCGCTGG CGGGTATGCGTATTCCGATTATCAACGAACACGGCGCAGATGAACTGCGTCGCGAACTGGAACT GCTGATGCTGGCCAGCGAAGAAGATGAAGAAGTTTCTTGCCTGATCACCGACGCACTGTGGTAT TTTGCCCAGTCTGTTGCAGATAGTCTGAACCTGCGTCGCCTGGTCCTGATGACCAGCAGCCTGT TCAATTTTCATGCCCACGTTAGTCTGCCGCAGTTCGATGAACTGGGTTATCTGGACCCGGATGA CAAAACCCGCCTGGAAGAACAGGCGAGCGGCTTTCCGATGCTGAAAGTCAAGGATATTAAGTCA GCGTACTCGAACTGGCAGATTCTGAAAGAAATCCTGGGTAAAATGATTAAGCAAACCAAAGCAA GTTCCGGCGTCATCTGGAATAGTTTCAAAGAACTGGAAGAATCCGAACTGGAAACGGTGATTCG TGAAATCCCGGCTCCGAGTTTTCTGATTCCGCTGCCGAAGCATCTGACCGCGAGCAGCAGCAGC CTGCTGGATCACGACCGCACGGTGTTTCAGTGGCTGGATCAGCAACCGCCGAGTTCCGTGCTGT ATGTTAGCTTCGGTAGTACCTCGGAAGTGGATGAAAAGGACTTTCTGGAAATCGCTCGTGGCCT GGTTGATAGCAAACAATCTTTCCTGTGGGTGGTTCGCCCGGGTTTTGTGAAGGGCTCTACGTGG GTTGAACCGCTGCCGGACGGCTTCCTGGGTGAACGTGGCCGCATTGTCAAATGGGTGCCGCAGC AAGAAGTGCTGGCGCATGGCGCGATTGGCGCGTTTTGGACCCACTCCGGTTGGAACTCAACGCT GGAATCGGTTTGTGAAGGTGTCCCGATGATTTTCTCAGATTTTGGCCTGGACCAGCCGCTGAAT GCACGTTATATGTCGGATGTTCTGAAAGTCGGTGTGTACCTGGAAAACGGTTGGGAACGCGGCG AAATTGCGAATGCCATCCGTCGCGTTATGGTCGATGAAGAAGGCGAATACATTCGTCAGAATGC TCGCGTCCTGAAACAAAAGGCGGACGTGAGCCTGATGAAAGGCGGTTCATCGTATGAAAGTCTG GAATCCCTGGTTTCATACATCAGCTCTCTGTAA SEQ ID NO. 25 Amino Acid Cytostolic-targeted UDP glycosyltransferase 76G1 (cytUTG) Stevia rebaudiana MENKTETTVRRRRRIILFPVPFQGHINPILQLANVLYSKGFSITIFHTNFNKPKTSNYPHFTFR FILDNDPQDERISNLPTHGPLAGMRIPIINEHGADELRRELELLMLASEEDEEVSCLITDALWY FAQSVADSLNLRRLVLMTSSLFNFHAHVSLPQFDELGYLDPDDKTRLEEQASGFPMLKVKDIKS AYSNWQILKEILGKMIKQTKASSGVIWNSFKELEESELETVIREIPAPSFLIPLPKHLTASSSS LLDHDRTVFQWLDQQPPSSVLYVSFGSTSEVDEKDFLEIARGLVDSKQSFLWVVRPGFVKGSTW VEPLPDGFLGERGRIVKWVPQQEVLAHGAIGAFWTHSGWNSTLESVCEGVPMIFSDFGLDQPLN ARYMSDVLKVGVYLENGWERGEIANAIRRVMVDEEGEYIRQNARVLKQKADVSLMKGGSSYESL ESLVSYISSL SEQ ID NO. 26 Amino Acid Glycosyltransferase (NtGT5a) Nicotiana tabacum MGSIGAELTKPHAVCIPYPAQGHINPMLKLAKILHHKGFHITFVNTEFNHRRLLKSRGPDSLKG LSSFRFETIPDGLPPCEADATQDIPSLCESTTNTCLAPFRDLLAKLNDTNTSNVPPVSCIVSDG VMSFTLAAAQELGVPEVLFWTTSACGFLGYMHYCKVIEKGYAPLKDASDLTNGYLETTLDFIPG MKDVRLRDLPSFLRTTNPDEFMIKFVLQETERARKASAIILNTFETLEAEVLESLRNLLPPVYP IGPLHFLVKHVDDENLKGLRSSLWKEEPECIQWLDTKEPNSVVYVNFGSITVMTPNQLIEFAWG LANSQQTFLWIIRPDIVSGDASILPPEFVEETKNRGMLASWCSQEEVLSHPAIVGFLTHSGWNS TLESISSGVPMICWPFFAEQQTNCWFSVTKWDVGMEIDSDVKRDEVESLVRELMVGGKGKKMKK KAMEWKELAEASAKEHSGSSYVNIEKLVNDILLSSKH SEQ ID NO. 27 DNA Glycosyltransferase (NtGT5a) Nicotiana tabacum ATGGGTTCCATTGGTGCTGAATTAACAAAGCCACATGCAGTTTGCATACCATATCCCGCCCAAG GCCATATTAACCCCATGTTAAAGCTAGCCAAAATCCTTCATCACAAAGGCTTTCACATCACTTT TGTCAATACTGAATTTAACCACCGACGTCTCCTTAAATCTCGTGGCCCTGATTCTCTCAAGGGT CTTTCTTCTTTCCGTTTTGAGACCATTCCTGATGGACTTCCGCCATGTGAGGCAGATGCCACAC AAGATATACCTTCTTTGTGTGAATCTACAACCAATACTTGCTTGGCTCCTTTTAGGGATCTTCT TGCGAAACTCAATGATACTAACACATCTAACGTGCCACCCGTTTCGTGCATCGTCTCGGATGGT GTCATGAGCTTCACCTTAGCCGCTGCACAAGAATTGGGAGTCCCTGAAGTTCTGTTTTGGACCA CTAGTGCTTGTGGTTTCTTAGGTTACATGCATTACTGCAAGGTTATTGAAAAAGGATATGCTCC ACTTAAAGATGCGAGTGACTTGACAAATGGATACCTAGAGACAACATTGGATTTTATACCAGGC ATGAAAGACGTACGTTTAAGGGATCTTCCAAGTTTCTTGAGAACTACAAATCCAGATGAATTCA TGATCAAATTTGTCCTCCAAGAAACAGAGAGAGCAAGAAAGGCTTCTGCAATTATCCTCAACAC ATTTGAAACACTAGAGGCTGAAGTTCTTGAATCGCTCCGAAATCTTCTTCCTCCAGTCTACCCC ATAGGGCCCTTGCATTTTCTAGTGAAACATGTTGATGATGAGAATTTGAAGGGACTTAGATCCA GCCTTTGGAAAGAGGAACCAGAGTGTATACAATGGCTTGATACCAAAGAACCAAATTCTGTTGT TTATGTTAACTTTGGAAGCATTACTGTTATGACTCCTAATCAGCTTATTGAGTTTGCTTGGGGA CTTGCAAACAGCCAGCAAACATTCTTATGGATCATAAGACCTGATATTGTTTCAGGTGATGCAT CGATTCTTCCACCCGAATTCGTGGAAGAAACGAAGAACAGAGGTATGCTTGCTAGTTGGTGTTC ACAAGAAGAAGTACTTAGTCACCCTGCAATAGTAGGATTCTTGACTCACAGTGGATGGAATTCG ACACTCGAAAGTATAAGCAGTGGGGTGCCTATGATTTGCTGGCCATTTTTCGCTGAACAGCAAA CAAATTGTTGGTTTTCCGTCACTAAATGGGATGTTGGAATGGAGATTGACAGTGATGTGAAGAG AGATGAAGTGGAAAGCCTTGTAAGGGAATTGATGGTTGGGGGAAAAGGCAAAAAGATGAAGAAA AAGGCAATGGAATGGAAGGAATTGGCTGAAGCATCTGCTAAAGAACATTCAGGGTCATCTTATG TGAACATTGAAAAGTTGGTCAATGATATTCTTCTTTCATCCAAACATTAA SEQ ID NO. 28 Amino Acid Glycosyltransferase (NtGT5b) Nicotiana tabacum MGSIGAEFTKPHAVCIPYPAQGHINPMLKLAKILHHKGFHITFVNTEFNHRRLLKSRGPDSLKG LSSFRFETIPDGLPPCDADATQDIPSLCESTTNTCLGPFRDLLAKLNDTNTSNVPPVSCIISDG VMSFTLAAAQELGVPEVLFWTTSACGFLGYMHYYKVIEKGYAPLKDASDLTNGYLETTLDFIPC MKDVRLRDLPSFLRTTNPDEFMIKFVLQETERARKASAIILNTYETLEAEVLESLRNLLPPVYP IGPLHFLVKHVDDENLKGLRSSLWKEEPECIQWLDTKEPNSVVYVNFGSITVMTPNQLIEFAWG LANSQQSFLWIIRPDIVSGDASILPPEFVEETKKRGMLASWCSQEEVLSHPAIGGFLTHSGWNS TLESISSGVPMICWPFFAEQQTNCWFSVTKWDVGMEIDCDVKRDEVESLVRELMVGGKGKKMKK KAMEWKELAEASAKEHSGSSYVNIEKVVNDILLSSKH SEQ ID NO. 29 DNA Glycosyltransferase (NtGT5b) Nicotiana tabacum ATGGGTTCCATTGGTGCTGAATTTACAAAGCCACATGCAGTTTGCATACCATATCCCGCCCAAG GCCATATTAACCCCATGTTAAAGCTAGCCAAAATCCTTCATCACAAAGGCTTTCACATCACTTT TGTCAATACTGAATTTAACCACAGACGTCTGCTTAAATCTCGTGGCCCTGATTCTCTCAAGGGT CTTTCTTCTTTCCGTTTTGAGACAATTCCTGATGGACTTCCGCCATGTGATGCAGATGCCACAC AAGATATACCTTCTTTGTGTGAATCTACAACCAATACTTGCTTGGGTCCTTTTAGGGATCTTCT TGCGAAACTCAATGATACTAACACATCTAACGTGCCACCCGTTTCGTGCATCATCTCAGATGGT GTCATGAGCTTCACCTTAGCCGCTGCACAAGAATTGGGAGTCCCTGAAGTTCTGTTTTGGACCA CTAGTGCTTGTGGTTTCTTAGGTTACATGCATTATTACAAGGTTATTGAAAAAGGATACGCTCC ACTTAAAGATGCGAGTGACTTGACAAATGGATACCTAGAGACAACATTGGATTTTATACCATGC ATGAAAGACGTACGTTTAAGGGATCTTCCAAGTTTCTTGAGAACTACAAATCCAGATGAATTCA TGATCAAATTTGTCCTCCAAGAAACAGAGAGAGCAAGAAAGGCTTCTGCAATTATCCTCAACAC ATATGAAACACTAGAGGCTGAAGTTCTTGAATCGCTCCGAAATCTTCTTCCTCCAGTCTACCCC ATTGGGCCCTTGCATTTTCTAGTGAAACATGTTGATGATGAGAATTTGAAGGGACTTAGATCCA GCCTTTGGAAAGAGGAACCAGAGTGTATACAATGGCTTGATACCAAAGAACCAAATTCTGTTGT TTATGTTAACTTTGGAAGCATTACTGTTATGACTCCTAATCAACTTATTGAATTTGCTTGGGGA CTTGCAAACAGCCAACAATCATTCTTATGGATCATAAGACCTGATATTGTTTCAGGTGATGCAT CGATTCTTCCCCCCGAATTCGTGGAAGAAACGAAGAAGAGAGGTATGCTTGCTAGTTGGTGTTC ACAAGAAGAAGTACTTAGTCACCCTGCAATAGGAGGATTCTTGACTCACAGTGGATGGAATTCG ACACTCGAAAGTATAAGCAGTGGGGTGCCTATGATTTGCTGGCCATTTTTCGCTGAACAGCAAA CAAATTGTTGGTTTTCCGTCACTAAATGGGATGTTGGAATGGAGATTGACTGTGATGTGAAGAG GGATGAAGTGGAAAGCCTTGTAAGGGAATTGATGGTTGGGGGAAAAGGCAAAAAGATGAAGAAA AAGGCAATGGAATGGAAGGAATTGGCTGAAGCATCTGCTAAAGAACATTCAGGGTCATCTTATG TGAACATTGAGAAGGTGGTCAATGATATTCTTCTTTCGTCCAAACATTAA SEQ ID NO. 30 Amino Acid UDP-glycosyltransferase 73C3 (NtGT4) Nicotiana tabacum MATQVHKLHFILFPLMAPGHMIPMIDIAKLLANRGVITTIITTPVNANRFSSTITRAIKSGLRI QILTLKFPSVEVGLPEGCENIDMLPSLDLASKFFAAISMLKQQVENLLEGINPSPSCVISDMGF PWTTQIAQNFNIPRIVFHGTCCFSLLCSYKILSSNILENITSDSEYFVVPDLPDRVELTKAQVS GSTKNTTSVSSSVLKEVTEQIRLAEESSYGVIVNSFEELEQVYEKEYRKARGKKVWCVGPVSLC NKEIEDLVTRGNKTAIDNQDCLKWLDNFETESVVYASLGSLSRLTLLQMVELGLGLEESNRPFV WVLGGGDKLNDLEKWILENGFEQRIKERGVLIRGWAPQVLILSHPAIGGVLTHCGWNSTLEGIS AGLPMVTWPLFAEQFCNEKLVVQVLKIGVSLGVKVPVKWGDEENVGVLVKKDDVKKALDKLMDE GEEGQVRRTKAKELGELAKKAFGEGGSSYVNLTSLIEDIIEQQNHKEK SEQ ID NO. 31 DNA UDP-glycosyltransferase 73C3 (NtGT4) Nicotiana tabacum ATGGCAACTCAAGTGCACAAACTTCATTTCATACTATTCCCTTTAATGGCTCCAGGCCACATGA TTCCTATGATAGACATAGCTAAACTTCTAGCAAATCGCGGTGTCATTACCACTATCATCACCAC TCCAGTAAACGCCAATCGTTTCAGTTCAACAATTACTCGTGCCATAAAATCCGGTCTAAGAATC CAAATTCTTACACTCAAATTTCCAAGTGTAGAAGTAGGATTACCAGAAGGTTGCGAAAATATTG ACATGCTTCCTTCTCTTGACTTGGCTTCAAAGTTTTTTGCTGCAATTAGTATGCTGAAACAACA AGTTGAAAATCTCTTAGAAGGAATAAATCCAAGTCCAAGTTGTGTTATTTCAGATATGGGATTT CCTTGGACTACTCAAATTGCACAAAATTTTAATATCCCAAGAATTGTTTTTCATGGTACTTGTT GTTTCTCACTTTTATGTTCCTATAAAATACTTTCCTCCAACATTCTTGAAAATATAACCTCAGA TTCAGAGTATTTTGTTGTTCCTGATTTACCCGATAGAGTTGAACTAACGAAAGCTCAGGTTTCA GGATCGACGAAAAATACTACTTCTGTTAGTTCTTCTGTATTGAAAGAAGTTACTGAGCAAATCA GATTAGCCGAGGAATCATCATATGGTGTAATTGTTAATAGTTTTGAGGAGTTGGAGCAAGTGTA TGAGAAAGAATATAGGAAAGCTAGAGGGAAAAAAGTTTGGTGTGTTGGTCCTGTTTCTTTGTGT AATAAGGAAATTGAAGATTTGGTTACAAGGGGTAATAAAACTGCAATTGATAATCAAGATTGCT TGAAATGGTTAGATAATTTTGAAACAGAATCTGTGGTTTATGCAAGTCTTGGAAGTTTATCTCG TTTGACATTATTGCAAATGGTGGAACTTGGTCTTGGTTTAGAAGAGTCAAATAGGCCTTTTGTA TGGGTATTAGGAGGAGGTGATAAATTAAATGATTTAGAGAAATGGATTCTTGAGAATGGATTTG AGCAAAGAATTAAAGAAAGAGGAGTTTTGATTAGAGGATGGGCTCCTCAAGTGCTTATACTTTC ACACCCTGCAATTGGTGGAGTATTGACTCATTGCGGATGGAATTCTACATTGGAAGGTATTTCA GCAGGATTACCAATGGTAACATGGCCACTATTTGCTGAGCAATTTTGCAATGAGAAGTTAGTAG TCCAAGTGCTAAAAATTGGAGTGAGCCTAGGTGTGAAGGTGCCTGTCAAATGGGGAGATGAGGA AAATGTTGGAGTTTTGGTAAAAAAGGATGATGTTAAGAAAGCATTAGACAAACTAATGGATGAA GGAGAAGAAGGACAAGTAAGAAGAACAAAAGCAAAAGAGTTAGGAGAATTGGCTAAAAAGGCAT TTGGAGAAGGTGGTTCTTCTTATGTTAACTTAACATCTCTGATTGAAGACATCATTGAGCAACA AAATCACAAGGAAAAATAG SEQ ID NO. 32 Amino Acid Glycosyltransferase (NtGT1b) Nicotiana tabacum MKTAELVFIPAPGMGHLVPTVEVAKQLVDRHEQLSITVLIMTIPLETNIPSYTKSLSSDYSSRI TLLPLSQPETSVTMSSFNAINFFEYISSYKGRVKDAVSETSFSSSNSVKLAGFVIDMFCTAMID VANEFGIPSYVFYTSSAAMLGLQLHFQSLSIECSPKVHNYVEPESEVLISTYMNPVPVKCLPGI ILVNDESSTMFVNHARRFRETKGIMVNTFTELESHALKALSDDEKIPPIYPVGPILNLENGNED HNQEYDAIMKWLDEKPNSSVVFLCFGSKGSFEEDQVKEIANALESSGYHFLWSLRRPPPKDKLQ FPSEFENPEEVLPEGFFQRTKGRGKVIGWAPQLAILSHPSVGGFVSHCGWNSTLESVRSGVPIA TWPLYAEQQSNAFQLVKDLGMAVEIKMDYREDFNTRNPPLVKAEEIEDGIRKLMDSENKIRAKV TEMKDKSRAALLEGGSSYVALGHFVETVMKN SEQ ID NO. 33 DNA Glycosyltransferase (NtGT1b) Nicotiana tabacum ATGAAGACAGCAGAGTTAGTATTCATTCCTGCTCCTGGGATGGGTCACCTTGTACCAACTGTGG AGGTGGCAAAGCAACTAGTCGACAGACACGAGCAGCTTTCGATCACAGTTCTAATCATGACAAT TCCTTTGGAAACAAATATTCCATCATATACTAAATCACTGTCCTCAGACTACAGTTCTCGTATA ACGCTGCTTCCACTCTCTCAACCTGAGACCTCTGTTACTATGAGCAGTTTTAATGCCATCAATT TTTTTGAGTACATCTCCAGCTACAAGGGTCGTGTCAAAGATGCTGTTAGTGAAACCTCCTTTAG TTCGTCAAATTCTGTGAAACTTGCAGGATTTGTAATAGACATGTTCTGCACTGCGATGATTGAT GTAGCGAACGAGTTTGGAATCCCAAGTTATGTGTTCTACACTTCTAGTGCAGCTATGCTTGGAC TACAACTGCATTTTCAAAGTCTTAGCATTGAATGCAGTCCGAAAGTTCATAACTACGTTGAACC TGAATCAGAAGTTCTGATCTCAACTTACATGAATCCGGTTCCAGTCAAATGTTTGCCCGGAATT ATACTAGTAAATGATGAAAGTAGCACCATGTTTGTCAATCATGCACGAAGATTCAGGGAGACGA AAGGAATTATGGTGAACACGTTCACTGAGCTTGAATCACACGCTTTGAAAGCCCTTTCCGATGA TGAAAAAATCCCACCAATCTACCCAGTTGGACCTATACTTAACCTTGAAAATGGGAATGAAGAT CACAATCAAGAATATGATGCGATTATGAAGTGGCTTGACGAGAAGCCTAATTCATCAGTGGTGT TCTTATGCTTTGGAAGCAAGGGGTCTTTCGAAGAAGATCAGGTGAAGGAAATAGCAAATGCTCT AGAGAGCAGTGGCTACCACTTCTTGTGGTCGCTAAGGCGACCGCCACCAAAAGACAAGCTACAA TTCCCAAGCGAATTCGAGAATCCAGAGGAAGTCTTACCAGAGGGATTCTTTCAAAGGACTAAAG GAAGAGGAAAGGTGATAGGATGGGCACCCCAGTTGGCTATTTTGTCTCATCCTTCAGTAGGAGG ATTCGTGTCGCATTGTGGGTGGAATTCAACTCTGGAGAGCGTTCGAAGTGGAGTGCCGATAGCA ACATGGCCATTGTATGCAGAGCAACAGAGCAATGCATTTCAACTGGTGAAGGATTTGGGTATGG CAGTAGAGATTAAGATGGATTACAGGGAAGATTTTAATACGAGAAATCCACCACTGGTTAAAGC TGAGGAGATAGAAGATGGAATTAGGAAGCTGATGGATTCAGAGAATAAAATCAGGGCTAAGGTG ACGGAGATGAAGGACAAAAGTAGAGCAGCACTGCTGGAGGGCGGATCATCATATGTAGCTCTTG GGCATTTTGTTGAGACTGTCATGAAAAACTAG SEQ ID NO. 34 Amino Acid Glycosyltransferase (NtGT1a) Nicotiana tabacum MKTTELVFIPAPGMGHLVPTVEVAKQLVDRDEQLSITVLIMTLPLETNIPSYTKSLSSDYSSRI TLLQLSQPETSVSMSSFNAINFFEYISSYKDRVKDAVNETFSSSSSVKLKGFVIDMFCTAMIDV ANEFGIPSYVFYTSNAAMLGLQLHFQSLSIEYSPKVHNYLDPESEVAISTYINPIPVKCLPGII LDNDKSGTMFVNHARRFRETKGIMVNTFAELESHALKALSDDEKIPPIYPVGPILNLGDGNEDH NQEYDMIMKWLDEQPHSSVVFLCFGSKGSFEEDQVKEIANALERSGNRFLWSLRRPPPKDTLQF PSEFENPEEVLPVGFFQRTKGRGKVIGWAPQLAILSHPAVGGFVSHCGWNSTLESVRSGVPIAT WPLYAEQQSNAFQLVKDLGMAVEIKMDYREDFNKTNPPLVKAEEIEDGIRKLMDSENKIRAKVM EMKDKSRAALLEGGSSYVALGHFVETVMKN SEQ ID NO. 35 DNA Glycosyltransferase (NtGT1a) Nicotiana tabacum ATGAAGACAACAGAGTTAGTATTCATTCCTGCTCCTGGCATGGGTCACCTTGTACCCACTGTGG AGGTGGCAAAGCAACTAGTCGACAGAGACGAACAGCTTTCAATCACAGTTCTCATCATGACGCT TCCTTTGGAAACAAATATTCCATCATATACTAAATCACTGTCCTCAGACTACAGTTCTCGTATA ACGCTGCTTCAACTTTCTCAACCTGAGACCTCTGTTAGTATGAGCAGTTTTAATGCCATCAATT TTTTTGAGTACATCTCCAGCTACAAGGATCGTGTCAAAGATGCTGTTAATGAAACCTTTAGTTC GTCAAGTTCTGTGAAACTCAAAGGATTTGTAATAGACATGTTCTGCACTGCGATGATTGATGTG GCGAACGAGTTTGGAATCCCAAGTTATGTCTTCTACACTTCTAATGCAGCTATGCTTGGACTCC AACTCCATTTTCAAAGTCTTAGTATTGAATACAGTCCGAAAGTTCATAATTACCTAGACCCTGA ATCAGAAGTAGCGATCTCAACTTACATTAATCCGATTCCAGTCAAATGTTTGCCCGGGATTATA CTAGACAATGATAAAAGTGGCACCATGTTCGTCAATCATGCACGAAGATTCAGG GAGACGAAAGGAATTATGGTGAACACATTCGCTGAGCTTGAATCACACGCTTTGAAAGCCCTTT CCGATGATGAGAAAATCCCACCAATCTACCCAGTTGGGCCTATACTTAACCTTGGAGATGGGAA TGAAGATCACAATCAAGAATATGATATGATTATGAAGTGGCTCGACGAGCAGCCTCATTCATCA GTGGTGTTCCTATGCTTTGGAAGCAAGGGATCTTTCGAAGAAGATCAAGTGAAGGAAATAGCAA ATGCTCTAGAGAGAAGTGGTAACCGGTTCTTGTGGTCGCTAAGACGACCGCCACCAAAAGACAC GCTACAATTCCCAAGCGAATTCGAGAATCCAGAGGAAGTCTTGCCGGTGGGATTCTTTCAAAGG ACTAAAGGAAGAGGAAAGGTGATAGGATGGGCACCCCAGTTGGCTATTTTGTCTCATCCTGCAG TAGGAGGATTCGTGTCGCATTGTGGGTGGAATTCAACTTTGGAGAGTGTTCGTAGTGGAGTACC GATAGCAACATGGCCATTGTATGCAGAGCAACAGAGCAATGCATTTCAACTGGTGAAGGATTTG GGGATGGCAGTGGAGATTAAGATGGATTACAGGGAAGATTTTAATAAGACAAATCCACCACTGG TTAAAGCTGAGGAGATAGAAGATGGAATTAGGAAGCTGATGGATTCAGAGAATAAAATCAGGGC TAAGGTGATGGAGATGAAGGACAAAAGTAGAGCAGCGTTATTAGAAGGCGGATCATCATATGTA GCTCTCGGGCATTTTGTTGAGACTGTCATGAAAAACTAA SEQ ID NO. 36 Amino Acid Glycosyltransferase (NtGT3) Nicotiana tabacum MKETKKIELVFIPSPGIGHLVSTVEMAKLLIAREEQLSITVLIIQWPNDKKLDSYIQSVANFSS RLKFIRLPQDDSIMQLLKSNIFTTFIASHKPAVRDAVADILKSESNNTLAGIVIDLFCTSMIDV ANEFELPTYVFYTSGAATLGLHYHIQNLRDEFNKDITKYKDEPEEKLSIATYLNPFPAKCLPSV ALDKEGGSTMFLDLAKRFRETKGIMINTFLELESYALNSLSRDKNLPPIYPVGPVLNLNNVEGD NLGSSDQNTMKWLDDQPASSVVFLCFGSGGSFEKHQVKEIAYALESSGCRFLWSLRRPPTEDAR FPSNYENLEEILPEGFLERTKGIGKVIGWAPQLAILSHKSTGGFVSHCGWNSTLESTYFGVPIA TWPMYAEQQANAFQLVKDLRMGVEIKMDYRKDMKVMGKEVIVKAEEIEKAIREIMDSESEIRVK VKEMKEKSRAAQMEGGSSYTSIGGFIQIIMENSQ SEQ ID NO. 37 DNA Glycosyltransferase (NtGT3) Nicotiana tabacum ATGAAAGAAACCAAGAAAATAGAGTTAGTCTTCATTCCTTCACCAGGAATTGGCCATTTAGTAT CCACAGTTGAAATGGCAAAGCTTCTTATAGCTAGAGAAGAGCAGCTATCTATCACAGTCCTCAT CATCCAATGGCCTAACGACAAGAAGCTCGATTCTTATATCCAATCAGTCGCCAATTTCAGCTCG CGTTTGAAATTCATTCGACTCCCTCAGGATGATTCCATTATGCAGCTACTCAAAAGCAACATTT TCACCACGTTTATTGCCAGTCATAAGCCTGCAGTTAGAGATGCTGTTGCTGATATTCTCAAGTC AGAATCAAATAATACGCTAGCAGGTATTGTTATCGACTTGTTCTGCACCTCAATGATAGACGTG GCCAATGAGTTCGAGCTACCAACCTATGTTTTCTACACGTCTGGTGCAGCAACCCTTGGTCTTC ATTATCATATACAGAATCTCAGGGATGAATTTAACAAAGATATTACCAAGTACAAAGACGAACC TGAAGAAAAACTCTCTATAGCAACATATCTCAATCCATTTCCAGCAAAATGTTTGCCGTCTGTA GCCTTAGACAAAGAAGGTGGTTCAACAATGTTTCTTGATCTCGCAAAAAGGTTTCGAGAAACCA AAGGTATTATGATAAACACATTTCTAGAGCTCGAATCCTATGCATTAAACTCGCTCTCACGAGA CAAGAATCTTCCACCTATATACCCTGTCGGACCAGTATTGAACCTTAACAATGTTGAAGGTGAC AACTTAGGTTCATCTGACCAGAATACTATGAAATGGTTAGATGATCAGCCCGCTTCATCTGTAG TGTTCCTTTGTTTTGGTAGTGGTGGAAGCTTTGAAAAACATCAAGTTAAGGAAATAGCCTATGC TCTGGAGAGCAGTGGGTGTCGGTTTTTGTGGTCGTTAAGGCGACCACCAACCGAAGATGCAAGA TTTCCAAGCAACTATGAAAATCTTGAAGAAATTTTGCCAGAAGGATTCTTGGAAAGAACAAAAG GGATTGGAAAAGTGATAGGATGGGCACCTCAGTTGGCGATTTTGTCACATAAATCGACGGGGGG ATTTGTGTCGCACTGTGGATGGAATTCGACTTTGGAAAGTACATATTTTGGAGTGCCAATAGCA ACCTGGCCAATGTACGCGGAGCAACAAGCGAATGCATTTCAATTGGTTAAGGATTTGAGAATGG GAGTTGAGATTAAGATGGATTATAGGAAGGATATGAAAGTGATGGGCAAAGAAGTTATAGTGAA AGCTGAGGAGATTGAGAAAGCAATAAGAGAAATTATGGATTCCGAGAGTGAAATTCGGGTGAAG GTGAAAGAGATGAAGGAGAAGAGCAGAGCAGCACAAATGGAAGGTGGCTCTTCTTACACTTCTA TTGGAGGTTTCATCCAAATTATCATGGAGAATTCTCAATAA SEQ ID NO. 38 Amino Acid Glycosyltransferase (NtGT2) Nicotiana tabacum MVQPHVLLVTFPAQGHINPCLQFAKRLIRMGIEVTFATSVFAHRRMAKTTTSTLSKGLNFAAFS DGYDDGFKADEHDSQHYMSEIKSRGSKTLKDIILKSSDEGRPVTSLVYSLLLPWAAKVAREFHI PCALLWIQPATVLDIYYYYFNGYEDAIKGSTNDPNWCIQLPRLPLLKSQDLPSFLLSSSNEEKY SFALPTFKEQLDTLDVEENPKVLVNTFDALEPKELKAIEKYNLIGIGPLIPSTFLDGKDPLDSS FGGDLFQKSNDYIEWLNSKANSSVVYISFGSLLNLSKNQKEEIAKGLIEIKKPFLWVIRDQENG KGDEKEEKLSCMMELEKQGKIVPWCSQLEVLTHPSIGCFVSHCGWNSTLESLSSGVSVVAFPHW TDQGTNAKLIEDVWKTGVRLKKNEDGVVESEEIKRCIEMVMDGGEKGEEMRRNAQKWKELAREA VKEGGSSEMNLKAFVQEVGKGC SEQ ID NO. 39 DNA Glycosyltransferase (NtGT2) Nicotiana tabacum ATGGTGCAACCCCATGTCCTCTTGGTGACTTTTCCAGCACAAGGCCATATTAATCCATGTCTCC AATTTGCCAAGAGGCTAATTAGAATGGGCATTGAGGTAACTTTTGCCACGAGCGTTTTCGCCCA TCGTCGTATGGCAAAAACTACGACTTCCACTCTATCCAAGGGCTTAAATTTTGCGGCATTCTCT GATGGGTACGACGATGGTTTCAAGGCCGATGAGCATGATTCTCAACATTACATGTCGGAGATAA AAAGTCGCGGTTCTAAAACCCTAAAAGATATCATTTTGAAGAGCTCAGACGAGGGACGTCCTGT GACATCCCTCGTCTATTCTCTTTTGCTTCCATGGGCTGCAAAGGTAGCGCGTGAATTTCACATA CCGTGCGCGTTACTATGGATTCAACCAGCAACTGTGCTAGACATATATTATTATTACTTCAATG GCTATGAGGATGCCATAAAAGGTAGCACCAATGATCCAAATTGGTGTATTCAATTGCCTAGGCT TCCACTACTAAAAAGCCAAGATCTTCCTTCTTTTTTACTTTCTTCTAGTAATGAAGAAAAATAT AGCTTTGCTCTACCAACATTTAAAGAGCAACTTGACACATTAGATGTTGAAGAAAATCCTAAAG TACTTGTGAACACATTTGATGCATTAGAGCCAAAGGAACTCAAAGCTATTGAAAAGTACAATTT AATTGGGATTGGACCATTGATTCCTTCAACATTTTTGGACGGAAAAGACCCTTTGGATTCTTCC TTTGGTGGTGATCTTTTTCAAAAGTCTAATGACTATATTGAATGGTTGAACTCAAAGGCTAACT CATCTGTGGTTTATATCTCATTTGGGAGTCTCTTGAATTTGTCAAAAAATCAAAAGGAGGAGAT TGCAAAAGGGTTGATAGAGATTAAAAAGCCATTCTTGTGGGTAATAAGAGATCAAGAAAATGGT AAGGGAGATGAAAAAGAAGAGAAATTAAGTTGTATGATGGAGTTGGAAAAGCAAGGGAAAATAG TACCATGGTGTTCACAACTTGAAGTCTTAACACATCCATCTATAGGATGTTTCGTGTCACATTG TGGATGGAATTCGACTCTGGAAAGTTTATCGTCAGGCGTGTCAGTAGTGGCATTTCCTCATTGG ACGGATCAAGGGACAAATGCTAAACTAATTGAAGATGTTTGGAAGACAGGTGTAAGGTTGAAAA AGAATGAAGATGGTGTGGTTGAGAGTGAAGAGATAAAAAGGTGCATAGAAATGGTAATGGATGG TGGAGAGAAAGGAGAAGAAATGAGAAGAAATGCTCAAAAATGGAAAGAATTGGCAAGGGAAGCT GTAAAAGAAGGCGGATCTTCGGAAATGAATCTAAAAGCTTTTGTTCAAGAAGTTGGCAAAGGTT GCTGA SEQ ID NO. 40 Amino Acid THCA Synthase Trichome targeting domain Cannabis MNCSAFSFWFVCKIIFFFLSFHIQISIA SEQ ID NO. 41 Amino Acid CBDA Synthase Trichome targeting domain Cannabis MKCSTFSFWFVCKIIFFFFSFNIQTSIA SEQ ID NO. 42 Amino Acid THCA Synthase Cannabis MNCSAFSFWFVCKIIFFFLSFHIQISIANPRENFLKCFSKHIPNNVANPKLVYTQHDQLYMSIL NSTIQNLRFISDTTPKPLVIVTPSNNSHIQATILCSKKVGLQIRTRSGGHDAEGMSYISQVPFV VVDLRNMHSIKIDVHSQTAWVEAGATLGEVYYWINEKNENLSFPGGYCPTVGVGGHFSGGGYGA LMRNYGLAADNIIDAHLVNVDGKVLDRKSMGEDLFWAIRGGGGENFGIIAAWKIKLVDVPSKST IFSVKKNMEIHGLVKLFNKWQNIAYKYDKDLVLMTHFITKNITDNHGKNKTTVHGYFSSIFHGG VDSLVDLMNKSFPELGIKKTDCKEFSWIDTTIFYSGVVNFNTANFKKEILLDRSAGKKTAFS1K LDYVKKPIPETAMVKILEKLYEEDVGAGMYVLYPYGGIMEEISESAIPFPHRAGIMYELWYTAS WEKQEDNEKHINWVRSVYNFTTPYVSQNPRLAYLNYRDLDLGKTNHASPNNYTQARIWGEKYFG KNFNRLVKVKTKVDPNNFFRNEQSIPPLPPHHH SEQ ID NO. 43 Amino Acid MYB8 - orthologue for CAN738 Humulus lupulus MGRAPCCEKVGLKKGRWTSEEDEILTKYIQSNGEGCWRSLPKNAGLLRCGKSCRLRWINYLRAD LKRGNISSEEEDIIIKLHSTLGNRWSLIASHLPGRTDNEIKNYWNSHLSRKIHTFRRCNNTTTH HHHLPNLVTVTKVNLPIPKRKGGRTSRLAMKKNKSSTSNQNSSVIKNDVGSSSSTTTTSVHQRT TTTTPTMDDQQKRQLSRCRLEEKEDQDGASTGTVVMMLGQAAAVGSSCDEDMLGHDQLSFLCCS EEKTTENSMTNLKENGDHEVSGPYDYDHRYEKETSVDEGMLLCFNDIIDSNLLNPNEVLTLSEE SLNLGGALMDTTTSTTTNNNNYSLSYNNNGDCVISDDHDQYWLDDVVGVDFWSWESSTTVTQEQ EQEQEQEQEQEQEQEQEQEHHHQQDQKKNTWDNEKEKMLALLWDSDNSNWELQDNNNYHKCQEI TSDKENAMVAWLLS SEQ ID NO. 44 Amino Acid atMYB12 - orthologue for CAN739 Arabidopsis thaliana MGRAPCCEKVGIKRGRWTAEEDQILSNYIQSNGEGSWRSLPKNAGLKRCGKSCRLRWINYLRSD LKRGNITPEEEELVVKLHSTLGNRWSLIAGHLPGRTDNEIKNYWNSHLSRKLHNFIRKPSISQD VSAVIMTNASSAPPPPQAKRRLGRTSRSAMKPKIHRTKTRKTKKTSAPPEPNADVAGADKEALM VESSGAEAELGRPCDYYGDDCNKNLMSINGDNGVLTFDDDIIDLLLDESDPGHLYTNTTCGGDG ELHNIRDSEGARGFSDTWNQGNLDCLLQSCPSVESFLNYDHQVNDASTDEFIDWDCVWQEGSDN NLWHEKENPDSMVSWLLDGDDEATIGNSNCENFGEPLDHDDESALVAWLLS SEQ ID NO. 45 Amino Acid MYB112 - orthologue for CAN833 Arabidopsis thaliana MNISRTEFANCKTLINHKEEVEEVEKKMEIEIRRGPWTVEEDMKLVSYISLHGEGRWNSLSRSA GLNRTGKSCRLRWLNYLRPDIRRGDISLQEQFIILELHSRWGNRWSKIAQHLPGRTDNEIKNYW RTRVQKHAKLLKCDVNSKQFKDTIKHLWMPRLIERIAATQSVQFTSNHYSPENSSVATATSSTS SSEAVRSSFYGGDQVEFGTLDHMTNGGYWFNGGDTFETLCSFDELNKWLIQ SEQ ID NO. 46 Amino Acid Cytosolic targeted THCA Synthase (ctTHCAs) Cannabis NPRENFLKCFSKHIPNNVANPKLVYTQHDQLYMSILNSTIQNLRFISDTTPKPLVIVTPSNNSH IQATILCSKKVGLQIRTRSGGHDAEGMSYISQVPFVVVDLRNMHSIKIDVHSQTAWVEAGATLG EVYYWINEKNENLSFPGGYCPTVGVGGHFSGGGYGALMRNYGLAADNIIDAHLVNVDGKVLDRK SMGEDLFWAIRGGGGENFGIIAAWKIKLVDVPSKSTIFSVKKNMEIHGLVKLFNKWQNIAYKYD KDLVLMTHFITKNITDNHGKNKTTVHGYFSSIFHGGVDSLVDLMNKSFPELGIKKTDCKEFSWI DTTIFYSGVVNFNTANFKKEILLDRSAGKKTAFSIKLDYVKKPIPETAMVKILEKLYEEDVGAG MYVLYPYGGIMEEISESAIPFPHRAGIMYELWYTASWEKQEDNEKHINWVRSVYNFTTPYVSQN PRLAYLNYRDLDLGKTNHASPNNYTQARIWGEKYFGKNFNRLVKVKTKVDPNNFFRNEQSIPPL PPHHH SEQ ID NO. 47 Amino Acid Trichome targeted Catalase with THCA Synthase Trichome targeting domain Arabidopsis thaliana MNCSAFSFWFVCKIIFFFLSFHIQISIAMDPYKYRPASSYNSPFFTTNSGAPVWNNNSSMTVGP RGLILLEDYHLVEKLANFDRERIPERVVHARGASAKGFFEVTHDISNLTCADFLRAPGVQTPVI VRFSTVIHARGSPETLRDPRGFAVKFYTREGNFDLVGNNFPVFFIRDGMKFPDIVHALKPNPKS HIQENWRILDFFSHHPESLNMFTFLFDDIGIPQDYRHMDGSGVNTYMLINKAGKAHYVKFHWKP TCGVKSLLEEDAIRLGGTNHSHATQDLYDSIAAGNYPEWKLFIQIIDPADEDKFDFDPLDVTKT WPEDILPLQPVGRMVLNKNIDNFFAENEQLAFCPAIIVPGIHYSDDKLLQTRVFSYADTQRHRL GPNYLQLPVNAPKCAHHNNHHEGFMNFMHRDEEVNYFPSRYDQVRHAEKYPTPPAVCSGKRERC IIEKENNFKEPGERYRTFTPERQERFIQRWIDALSDPRITHEIRSIWISYWSQADKSLGQKLAS RLNVRPSI SEO ID NO. 48 Amino Acid Trichome targeted Catalase with CBDA Synthase Trichome targeting domain Arabidopsis thaliana MKCSTFSFWFVCKIIFFFFSFNIQTSIAMDPYKYRPASSYNSPFFTTNSGAPVWNNNSSMTVGP RGLILLEDYHLVEKLANFDRERIPERVVHARGASAKGFFEVTHDISNLTCADFLRAPGVQTPVI VRFSTVIHARGSPETLRDPRGFAVKFYTREGNFDLVGNNFPVFFIRDGMKFPDIVHALKPNPKS HIQENWRILDFFSHHPESLNMFTFLFDDIGIPQDYRHMDGSGVNTYMLINKAGKAHYVKFHWKP TCGVKSLLEEDAIRLGGTNHSHATQDLYDSIAAGNYPEWKLFIQIIDPADEDKFDFDPLDVTKT WPEDILPLQPVGRMVLNKNIDNFFAENEQLAFCPAIIVPGIHYSDDKLLQTRVFSYADTQRHRL GPNYLQLPVNAPKCAHHNNHHEGFMNFMHRDEEVNYFPSRYDQVRHAEKYPTPPAVCSGKRERC IIEKENNFKEPGERYRTFTPERQERFIQRWIDALSDPRITHEIRSIWISYWSQADKSLGQKLAS RLNVRPSI SEQ ID NO. 49 Amino Acid Catalase HPII (KatE) with THCA Synthase Trichome targeting domain Escherichia coli MNCSAFSFWFVCKIIFFFLSFHIQISIAMSQHNEKNPHQHQSPLHDSSEAKPGMDSLAPEDGSH RPAAEPTPPGAQPTAPGSLKAPDTRNEKLNSLEDVRKGSENYALTTNQGVRIADDQNSLRAGSR GPTLLEDFILREKITHFDHERIPERIVHARGSAAHGYFQPYKSLSDITKADFLSDPNKITPVFV RFSTVQGGAGSADTVRDIRGFATKFYTEEGIFDLVGNNTPIFFIQDAHKFPDFVHAVKPEPHWA IPQGQSAHDTFWDYVSLQPETLHNVMWAMSDRGIPRSYRTMEGFGIHTFRLINAEGKATFVRFH WKPLAGKASLVWDEAQKLTGRDPDFHRRELWEAIEAGDFPEYELGFQLIPEEDEFKFDFDLLDP TKLIPEELVPVQRVGKMVLNRNPDNFFAENEQAAFHPGHIVPGLDFTNDPLLQGRLFSYTDTQI SRLGGPNFHEIPINRPTCPYHNFQRDGMHRMGIDTNPANYEPNSINDNWPRETPPGPKRGGFES YQERVEGNKVRERSPSFGEYYSHPRLFWLSQTPFEQRHIVDGFSFELSKVVRPYIRERVVDQLA HIDLTLAQAVAKNLGIELTDDQLNITPPPDVNGLKKDPSLSLYAIPDGDVKGRVVAILLNDEVR SADLLAILKALKAKGVHAKLLYSRMGEVTADDGTVLPIAATFAGAPSLTVDAVIVPCGNIADIA DNGDANYYLMEAYKHLKPIALAGDARKFKATIKIADQGEEGIVEADSADGSFMDELLTLMAAHR VWSRIPKIDKIPA SEQ ID NO. 50 Amino Acid Catalase HPII (KatE) with CBDA Synthase Trichome targeting domain Escherichia coli MKCSTFSFWFVCKIIFFFFSFNIQTSIAMSQHNEKNPHQHQSPLHDSSEAKPGMDSLAPEDGSH RPAAEPTPPGAQPTAPGSLKAPDTRNEKLNSLEDVRKGSENYALTTNQGVRIADDQNSLRAGSR GPTLLEDFILREKITHFDHERIPERIVHARGSAAHGYFQPYKSLSDITKADFLSDPNKITPVFV RFSTVQGGAGSADTVRDIRGFATKFYTEEGIFDLVGNNTPIFFIQDAHKFPDFVHAVKPEPHWA IPQGQSAHDTFWDYVSLQPETLHNVMWAMSDRGIPRSYRTMEGFGIHTFRLINAEGKATFVRFH WKPLAGKASLVWDEAQKLTGRDPDFHRRELWEAIEAGDFPEYELGFQLIPEEDEFKFDFDLLDP TKLIPEELVPVQRVGKMVLNRNPDNFFAENEQAAFHPGHIVPGLDFTNDPLLQGRLFSYTDTQI SRLGGPNFHEIPINRPTCPYHNFQRDGMHRMGIDTNPANYEPNSINDNWPRETPPGPKRGGFES YQERVEGNKVRERSPSFGEYYSHPRLFWLSQTPFEQRHIVDGFSFELSKVVRPYIRERVVDQLA HIDLTLAQAVAKNLGIELTDDQLNITPPPDVNGLKKDPSLSLYAIPDGDVKGRVVAILLNDEVR SADLLAILKALKAKGVHAKLLYSRMGEVTADDGTVLPIAATFAGAPSLTVDAVIVPCGNIADIA DNGDANYYLMEAYKHLKPIALAGDARKFKATIKIADQGEEGIVEADSADGSFMDELLTLMAAHR VWSRIPKIDKIPA SEQ ID NO. 51 DNA CYP3A4 forward primer Artificial TGCCTAATAAAGCTCCTCCTACT SEQ ID NO. 52 DNA CYP3A4 reverse primer Artificial GCTCCTGAAACAGTTCCATCTC SEQ ID NO. 53 DNA P450 oxidoreductase forward primer Artificial GGAAGAGCTTTGGTTCCTATGT SEQ ID NO. 54 DNA P450 oxidoreductase reverse primer Artificial GCTCCCAATTCAGCAACAATATC SEQ ID NO. 55 DNA CBDA synthase forward primer artificial ACATCACAATCACACAAAACTAACAAAAG SEQ ID NO. 56 DNA CBDA synthase reverse primer Artificial GGCCATAGTTTCTCATCAATGG SEQ ID NO. 57 DNA UGT76G1 forward primer Artificial GATTGGAAGAACAAGCTTCAGGATTTCC SEQ ID NO. 58 DNA UGT76G1 reverse primer Artificial CCATCCTGAATGAGTCCAAAAAGCTC SEQ ID NO. 59 DNA ABCG2 forward primer CCTTCAGGATTGTCAGGAGATG SEQ ID NO. 60 DNA ABCG2 reverse primer Artificial GCAGGTCCATGAAACATCAATC SEQ ID NO. 61 DNA Trichome-targeted CBDA forward primer Artificial AAAGATCAAAAGCAAGTTCTTCACTGT SEQ ID NO. 62 DNA Trichome-targeted CBDA reverse primer Artificial CCATGCAGTTTGGCTATGAACATCT SEQ ID NO. 63 DNA Trichome-targeted UGT forward primer Artificial AGTGCTCAACATTCTCCTTTTGGTT SEQ ID NO. 64 DNA Trichome-targeted UGT reverse primer Artificial TCTGAAGCCAACATCAACAATTCCA SEQ ID NO. 65 DNA Plasma membrane-targeted UTR1 forward primer Artificial TTGTTCCTTAAACCTCGCCTTTGAC SEQ ID NO. 66 DNA Plasma membrane-targeted UTR1 reverse primer Artificial TCATTATGGAGCACTCCACTCTCTG SEQ ID NO. 67 DNA Cytosolic-targeted CBDA forward primer Artificial AAAGATCAAAAGCAAGTTCTTCACTGT SEQ ID NO. 68 DNA Cytosolic-targeted CBDA reverse primer Artificial ATAAACTTCTCCAAGGGTAGCTCCG SEQ ID NO. 69 DNA Cytosolic-targeted UGT forward primer Artificial AGAACTGGAAGAATCCGAACTGGAA SEQ ID NO. 70 DNA Cytosolic-targeted UGT reverse primer Artificial AAATCATCGGGACACCTTCACAAAC 

What is claimed is:
 1. A method for the production of water-soluble cannabinoids in the cytosol of a Cannabis cell comprising expressing in the Cannabis sativa cell a nucleotide sequence, operably linked to a promoter, encoding: i. a cannabinoid synthase endogenous to said Cannabis sativa cell, having its trichome targeting signal disrupted or removed, wherein the cannabinoid synthase is a tetrahydrocannabinolic acid synthase (THCA synthase) polypeptide having the amino acid sequence of SEQ ID NO: 46, or an amino acid sequence having 98% sequence identity with SEQ ID NO: 46, and a cannabidiolic acid synthase (CBDA synthase) polypeptide having the amino acid sequence of SEQ ID NO: 23, or an amino acid sequence having 98% sequence identity with SEQ ID NO: 23; ii. a heterologous glycosyltransferase, selected from the group consisting of: SEQ ID NO: 7, 27, 29, 31, 33, 35, 37, and 39; iii. a myb transcription factor having SEQ ID NO: 11, or wherein the nucleotide sequence encodes a polypeptide having an amino acid sequence selected from the group consisting of: SEQ 12 and 43-45; iv. a heterologous catalase enzyme; v. a heterologous cytochrome P450 according to SEQ ID NO: 1; and vi. a heterologous P450 oxidoreductase according to SEQ ID NO:
 3. 2. The method of claim 1, wherein said catalase is selected from the group consisting of: SEQ ID NO: 13 and
 15. 3. A method for producing water-soluble cannabinoids comprising expressing in a Cannabis sativa plant cell: i. a nucleotide sequence, operably linked to a promoter, encoding a cannabinoid synthase from Cannabis sativa, having its trichome targeting signal disrupted or removed, wherein the cannabinoid synthase is a THCA synthase polypeptide having the amino acid sequence of SEQ ID NO: 46, or an amino acid sequence having 98% sequence identity with SEQ ID NO: 46, and a CBDA synthase polypeptide having the amino acid sequence of SEQ ID NO: 23, or an amino acid sequence having 98% sequence identity with SEQ ID NO: 23; ii. a nucleotide sequence, operably linked to a promoter, encoding a heterologous glycosyltransferase selected from the group consisting of: SEQ ID NO: 7, 27, 29, 31, 33, 35, 37 and 39; iii. a nucleotide sequence, operably linked to a promoter, encoding a myb transcription factor having SEQ ID NO: 11, or wherein the nucleotide sequence encodes a polypeptide having an amino acid sequence selected from the group consisting of SEQ ID NO:12 and 43-45; and iv. a nucleotide sequence, operably linked to a promoter, encoding a heterologous catalase enzyme; and wherein the cannabinoid synthase synthesizes a cannabinoid in the cytosol of said Cannabis sativa cell selected from the group consisting of: cannabidiol (CBD), cannabidiolic acid (CBDA), tetrahydrocannabinolic acid (THCA) and tetrahydrocannabinol (THC), and wherein the cannabinoid is further glycosylated in the cytosol of said Cannabis sativa cell.
 4. A method for the production of water-soluble cannabinoids comprising expressing in a Cannabis cell: i. a nucleotide sequence encoding a CBDA synthase polypeptide from Cannabis sativa having the amino acid sequence of SEQ ID NO: 23, or an amino acid sequence having 98% sequence identity with SEQ ID NO: 23, having its trichome targeting signal disrupted or removed; ii. a nucleotide sequence, operably linked to a promoter, encoding a heterologous glycosyltransferase selected from the group consisting of: SEQ ID NO: 29 and 31; iii. a nucleotide sequence, operably linked to a promoter, encoding a myb transcription factor having SEQ ID NO: 11, or wherein the nucleotide sequence encodes a polypeptide having an amino acid sequence selected from the group consisting of: SEQ ID NO: 12 and 43-45; and iv. a nucleotide sequence, operably linked to a promoter, encoding a heterologous catalase enzyme; wherein said CBDA synthase synthesizes CBDA in the cytosol of said Cannabis cell, and wherein said CBDA is further glycosylated in the cytosol of said Cannabis cell forming a CBDA glycoside or a CBD glycoside.
 5. The method of claim 4, further comprising the step of expressing a nucleotide sequence according to SEQ ID NO: 1, operably linked to a promoter, encoding a heterologous cytochrome P450.
 6. The method of claim 4, further comprising the step of expressing a nucleotide sequence according to SEQ ID NO: 3, operably linked to a promoter, encoding a heterologous P450 oxidoreductase.
 7. The method of claim 4, wherein said catalase enzyme is selected from the group consisting of: the nucleotide sequence of SEQ ID NO: 13 and
 15. 